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F oreword 


The Western Joint Computer Conference returned to San Francisco, Calif., for the second 
time in 1959. The dynamic growth in our industry and the computer industry growth in the 
San Francisco area is evidenced by the doubling of attendance since 1956. 

The 1959 Western Joint Computer Conference presented a view of “New Horizons with 
Computer Technology.” These new horizons are appearing with the new circuits and devices 
that are arising from the maturing research efforts of our growing businesses. Similarly, new 
vistas are opening as a result of the knowledge gained through the application and utilization 
of computers throughout our entire economy. 

These annual conferences are intended to provide an opportunity for the professional 
people in the computer industry to discuss their technical and business interests and accom- 
plishments. The papers herein present the record of these discussions. 

R. R. Johnson 

Conference Chairman 
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New Horizons in Systems 

DARWIN E. ELLETTf 


I T HAS been our experience in the Air Materiel Com- 
mand, and I imagine this is true of everyone en- 
gaged in data systems modernization and automa- 
tion, that the further we advance, the greater potential 
we find for achievement. A prime characteristic of our 
objectives would appear to be mobility, for as we move 
closer to each established milestone, we inevitably find 
a new horizon beyond. So, in keeping with my subject, 
I propose to review some suggested issues and oppor- 
tunities for future exploration and development. 

The field of data systems design and automation is at 
once broad and complex, with its many interlocking 
facets of concept, principle, methodology, equipping, 
and technique, and its deep penetration into every 
phase of management and operational activities. 
Further, it is a very dynamic field. There are interacting 
advancements afoot in all of these facets and all of these 
phases, and there is, therefore, a wide range of new hor- 
izons available for examination. 

In the light of this situation, my comments neces- 
sarily follow from a process of selection and limitation. 
I have chosen those areas which appear to us to be of the 
broadest application and the most basic concern. 

As a first order of limitation, I shall confine my re- 
marks to management systems from the viewpoint of 
the Air Materiel Command. I shall further limit the 
subject to a discussion of four steps in the management 
process: 

1) The expression of the mission of the Command. 

2) The establishment of the work processes for the 
accomplishment of this mission. 

3) The establishment and organization of the man- 
agement rules associated with these work proc- 
esses. 

4) The translation of these rules, where appropriate, 
into the procedural detail required for machine 
application. 

Although I am basing my remarks on our experiences 
in the Air Materiel Command, I think it is safe to say 
that the fundamental management decisions are sub- 
stantially the same in all enterprises. Therefore, the 
issues upon which I shall build my case are offered as be- 
ing of possible general application. In this connection, 
since I am basically describing a possible course for our 
own future efforts, my intent is simply to add whatever 
stimulus I may to your thinking, rather than to pro- 
pound ready-made answers. 

The immediate reaction to my selection of specific 
topics may be that I am outside the lawful hunting 

f Colonel, USAF ; Chief, Data Systems Plans Div., Air Materiel 
Command, Wright-Patterson AFB, Ohio. 


ground of the data systems designer. Traditionally, this 
is true. I think most of us work under written or im- 
plicitly accepted legislation which decrees that the estab- 
lishment and ordering of work processes is a separate 
task, and that management retains the prerogative of 
deciding and specifying how it shall manage. 

Let it be understood at once that I am not about to 
make a case for major adjustment in the organizational 
chart, with the data systems planner to come into a new 
ascendancy. Rather, I am making the proposition that 
we cannot do all that we should, if in our data systems 
work we go no deeper than a reappraisal and updating of 
accounting and computational methodology, for this 
level is to a large degree no more than a reflection of the 
basic work flow itself. 

In developing a case for positive action, I should like 
to begin with a brief review of our own data systems 
modernization program. Since early 1954, we have been 
vigorously pursuing a command-wide program for mod- 
ernizing data systems through the use of electronic data 
processing and modern communications. Our primary 
aim has been data systems integration. I shall not at- 
tempt to define integration to the satisfaction of every- 
one, but I should like to devote a moment or two to 
pointing up what such an objective has meant to our 
program. Simply stated, it has meant that our program 
could be no less in scope than the command itself. While 
we do pursue a policy of extensive decentralization, all 
of our activities and operating facilities are themselves 
completely interdependent; they are as indivisible as 
airpower itself. 

Let me further enlarge on this point by a brief de- 
scription of our specific management responsibilities and 
relationships. 

The Air Materiel Command is the agency responsible 
for materiel support of all Air Force organizations and 
activities in accordance with established Air Force plans 
and programs. This involves: 

1) Managing the production and delivery into active 
service of the air weapons themselves. 

2) Determination of requirements and control of 
world-wide distribution of the million and a half 
line items of spare assemblies, parts, and support- 
ing equipment and supplies in today’s inventories. 

3) Operation of a formidable array of industrial fa- 
cilities which must physically accomplish the pro- 
curement, storage, transportation, and repair of 
these items. 

In the execution of these responsibilities, we do not 
have a situation, as have many industrial concerns, 
where each operating division, or plant, can be perma- 
nently identified with a separate product, or range of 
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products, and can therefore engage in separate forecast- 
ing, workload planning, budgeting, production, and dis- 
tribution. To the contrary, it is in the nature of the Air 
Force mission, and the highly dynamic state of research 
and development, that air weapons will constantly enter 
and leave the active inventory, and during their life 
cycles, will enjoy different relative precedences at dif- 
ferent points in time. All command resources, in funds, 
materiel of common usage, and facilities, must be ap- 
plied in the manner most effective for appropriate sup- 
port of all weapons, in accordance with the current 
combat force structure, and the precedences of units 
and weapons within this structure. 

This condition brings to bear two severe criteria for 
the data systems planner: 

1) In the light of the very dynamic operational and 
technological environment, there must be a very high 
capability for rapid and coordinated reaction and ad- 
justment down through all of the levels of materiel sup- 
port programming and program execution, from weapon 
production, through scheduling of spares acquisition 
and allocation, to workloading of the industrial facil- 
ities. 

2) The managers of all components of the command, 
however organized and wherever located within our 
fourteen major management centers across the country, 
are jointly engaged in managing a common enterprise 
in support of a common workload. This means that 
there is complete interdependency of management deci- 
sion and action, and therefore interdependency for 
management information. Thus, we have, in fact, a 
single management process and the need for a single, 
all-inclusive data system. 

With an awareness of this need, our data systems de- 
velopment program has moved out on this general 
basis : 

1) We have defined and published this objective of 
systems integration, and the essential detail of planning 
criteria it engenders. 

2) We have initiated and are well along on a systems 
design and application program which provides for the 
identification and separate modernization of data sub- 
systems. 

There are about 30 of these subsystems within our 
total data system. Examples range from the computa- 
tion of requirements for aircraft spare parts, to inven- 
tory control of ground vehicles, to cost accounting and 
labor distribution in the maintenance shops. The objec- 
tive of command-wide integration implies the need for 
command-wide procedural standardization, and our de- 
velopment program is geared to this end. A formal de- 
velopment project has been established for each sub- 
system. Each such project is guided by a Headquarters 
steering group made up of representatives of all offices 
of management or systems interest and monitored by a 
representative of the Data Systems Plans Division. 
Since our materiel operation is decentralized, most of 
the projects cover depot-level applications. Therefore, 


much of the procedures design work, and the machine 
programming, test, and initial computer application are 
assigned to a “pilot” Air Force Depot in each case. 
When this work is completed, all other Air Force Depots 
concerned implement the standard product as de- 
veloped. (Air Materiel Areas and Air Force Depots are 
equivalents for our purposes here. Each of these ac- 
tivities holds world-wide materiel management responsi- 
bility for an assigned segment of the total inventory. In 
addition, each is the site of a part of the industrial fa- 
cility complex, and as such engages in materiel process- 
ing, that being primarily major repair and wholesale 
storage, for the various materiel management elements.) 

As one can see, under this approach we are taking the 
data system apart, updating its components, and put- 
ting it back together again in a new composite. We are 
doing this through central control against an end of 
integration, and through decentralized but standard 
modernization of components. 

We can report that this approach works, and it works 
rather well if the objective is to make rapid improve- 
ments in data handling. However, it will not quickly 
achieve integrated systems, as will become evident in 
this discussion. We have been attacking the subsystems 
on a “pay-off-as-we-go” basis, in an order of priority 
which affords the greatest rate of improvement in our 
combat support capability and in resources manage- 
ment. Many of our priority efforts have reached the op- 
erational stage, with acceptable results; others are 
nearly there. 

About a dozen large-scale and 30-odd small- and 
medium-scale computers are in active use at Head- 
quarters and at our Air Materiel Areas and Air Force 
Depots. We have in being a world-wide punch-card 
transceiver net which links these centers and most of the 
major Air Force Bases, and we are well along toward 
significant further improvement in our communication 
systems. 

Beyond these immediate data systems improvements, 
there has been increasing awareness of the need and op- 
portunity for greater management integration. In recent 
months, the command has undergone considerable ad- 
justment in management organization, moving into an 
aggregation of responsibilities which parallels closely 
the present array of decision points and resulting data 
flow. 

While our data system work has been by no means 
the sole (and quite possibly not even the primary) cause 
of this reorganization, it is safe to say that it has con- 
tributed an important and positive influence. At the 
same time, and notwithstanding these significant 
achievements, if we are to attain and retain the ultimate 
in an integrated system, there is much yet to be done, 
and this brings us back to the further efforts I defined at 
the beginning of this discussion. 

Before I move into a brief review of these issues, I 
should like to point out that much of what I shall say is 
based primarily on personal thoughts and observations, 
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and should not be regarded as a chronicle of what the 
Air Force now officially proposes to do. I am simply pro- 
posing to share with you, as fellow workers in the sys- 
tems field, some of the ideas that have evolved in at- 
tempting to project my thinking beyond today’s state- 
of-the-art and potential achievement in the near term. 

Let us begin with my first step, “The expression of 
the mission of the Command.” As presently stated, the 
Air Materiel Command does have an abstract single 
ultimate objective. In a very simplified version, it is: 
To produce and deliver air weapons and thereafter to 
support them through their life cycles, in accordance 
with Air Force plans and programs and priorities, and 
as indicated earlier, we have a single integrated enter- 
prise to accomplish this mission. Although it has a 
multiplicity of functions, and a complex far-flung or- 
ganization, it is nevertheless of single purpose and uni- 
fied by nature. Now, in the modernization of data sys- 
tems over the past few years, it has become increasingly 
evident that if the many decision points in this total 
system are to be equated, then the mission itself, to 
which they should all relate, must be stated quantita- 
tively. There have, of course, been many advancements 
over the years toward a more precise definition of the 
end product of the work of the Command. For example, 
we have used and continue to use a desired in commis- 
sion rate for each of the various types of aircraft and 
missiles in the inventory. And to some extent, we have 
used a desired percentage of positive supply action as 
an objective of the enterprise. We use and have used an 
array of other objectives, such as a desired percentage of 
the aircraft and missiles in commission that are equipped 
to perform their wartime mission, or the sortie capabil- 
ity desired at a given point in time by various units, 
with an expression of the quantitative materiel require- 
ments at a particular site to achieve this. None of these 
objectives or definitions of the products of the Com- 
mand are entirely satisfactory when viewed from the 
standpoint of the data system designer, who is attempt- 
ing to relate appropriately all decision points to each 
other and to the mission of the organization. This is so 
because the objective has never been defined as a single 
all-inclusive quantified expression. I realize that to 
achieve this is a tremendous undertaking, but without 
it, no final evaluation of all of the work processes as to 
their essentiality and quality can be achieved nor can 
the best supporting automated data system be attained. 
Our systems people are not now actively working in this 
particular area, but we realize that to do so will become 
increasingly more important with the passage of time. 

Next, the establishment and description of the physi- 
cal work processes necessary to the accomplishment of 
this mission. By referring again to our single ultimate 
purpose, and the unity of our enterprise, we come to this 
conclusion: The entire operation can be considered in 
total as a world-wide production line, composed of an 
unbroken series of work steps, from original input of Air 
Force programs to final delivery of the last item of ma- 


teriel to the using forces. These are the action steps 
which march through all phases of planning, program- 
ming, and budgeting, and the direction, execution, and 
evaluation of results of these programs. And although 
there is a positive interrelationship between the work 
steps and the rules associated with them, I believe the 
general pattern of work must be planned before setting 
down the rules by which it will be directed and reported. 

I do not know how many such steps would be involved 
in the typical situation, nor do I know how many there 
are in our own system, but obviously there are a consid- 
erable number, possibly several thousand. In whatever 
event, a description of these steps, in their logical se- 
quence and without regard to current organization or 
division of responsibility, is a delineation of the live sys- 
tem itself, and is therefore the foundation for the most 
effective data system. Since the advent of the electronic 
computer, and of course even before that, we have all 
given much attention to the data needs of management, 
and have been much concerned with meeting these re- 
quirements. I am now suggesting that there is a more 
basic task at hand, if we are to explore the full potential 
of today’s equipment and techniques. We must equip 
ourselves with a systematic view of the total work proc- 
ess in which the data system is grounded, which is at 
once the source of all its inputs and the destination of 
all outputs. For the relatively small and simple enter- 
prise, operating at one location, this may be no great 
problem ; all of the elements may literally be already in 
sight. However, for the large-scale activity, which has 
expanded with the industrial revolution and has tended 
to grow up in segments, through the necessary multipli- 
cation of management, this is no small task. 

I can perhaps elaborate a bit on the true extent of 
such an effort, and further establish ou,r need for this 
systematic view, by expanding the discussion of a work 
step. We find that in the operating system, the work re- 
quired is a mixture of data processing and materiel proc- 
essing, interwoven and interlocked. In the early stages 
of the cycle, particularly those involved in planning, 
programming, and budgeting, only data processing is 
involved. However, beyond the first point at which ma- 
teriel is committed to manufacture, there is an interde- 
pendent blend of materiel processing and data process- 
ing in terms of work flow. From this observation, we 
have drawn the conclusion that data processing is so 
closely related to the physical work process that it is 
inseparable of design if the ultimate in integration is to 
be achieved. This alone is sufficient reason for a view 
which includes all work steps of whatever specific na- 
ture. 

And this is by no means the whole story. If we follow 
this logic a bit further, we come to a rather startling 
revelation. While we have said that data processing is an 
integral part of the total work process, we are still con- 
fronted with the fact that the basic job is to manufac- 
ture, deliver, store, repair, use, and dispose of materiel. 
To go back to one of our earliest axioms: data processing 
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is not an end in itself, paper can never be the final prod- 
uct. This means that data processing serves only to pro- 
vide a control and a report; a control to direct a physi- 
cal action with reference to the materiel itself, and a re- 
port of action accomplished which in turn leads to direc- 
tion of the next succeeding materiel work step, another 
action, another report, and so on throughout the entire 
system. Of course, there is a very definite layering effect 
in data processing. The execution of one data step may 
lead only to another, down through any number, but 
the whole series must eventually emerge into direction 
of a materiel processing action in our business or it has 
no purpose. 

If all of this is so, then it must surely follow that much 
of the complexity and volume of data processing must be 
charged to the number of separate control directions to 
be given and reports to be rendered, and the extent of 
their repetition. 

Therefore, can we not postulate from this that the 
road to data systems simplification must finally lead 
through simplification of the materiel work process? 
The simpler and better organized the ultimate job, the 
simpler the control and reporting system. 

Beyond this, lies factory automation, which I think 
most of us have historically regarded as a not-too-fa- 
miliar cousin. As the physical work itself is automated in 
longer and longer runs, there must be a corresponding 
decrease in the number of points at which individual 
energizing instructions must be entered from the central 
management data system. It should be possible in the 
foreseeable future to introduce original instructions at 
point of entry of materiel into the automated production 
run at hand, and thereupon, to pass control to the local 
automatic governing mechanism, with no need for re- 
port or new instruction until the materiel is delivered at 
the other end of the line. The longer and, therefore, the 
fewer number of separate production runs, the lesser de- 
mands upon the central management data system. 
Here again a case for reviewing the steps involved in 
both data processing and materiel processing as a 
single entity, with improvement in one basic to per- 
formance in the other. 

Our case can be extended through examination of one 
final issue in this cycle of events. By automation of the 
materiel production process in long sequences, we do, as 
I have indicated, reduce the need for step-by-step con- 
trol from the central management data system. But the 
fact remains that this control must go on. We have 
simply transferred it to the local automatic control de- 
vice for the production run in question. Now, two facts 
become self-evident. First, this local control device is 
itself by nature a data processor, moving through a 
series of orders and counts, which are quantitative of 
expression, and second, it is energized by the input of 
production schedules which -can only come from the 
central management data system. Again, the ordering 
and processing of the work itself is basic to a correct 
design of the data system, and automation of materiel 


production is by definition to a large degree automa- 
tion of management data processing. 

In summary on this point, we come to the inevitable 
conclusion that we must eventually cease to view the 
data system as something apart; we must step up to a 
plane from which we can attack the entire operating 
system as an inseparable entity. 

I should like to turn now to my third point: the es- 
tablishment and organization of management rules 
associated with the work processes. This subject reflects 
to a considerable degree the same issues as those we 
found in reviewing the subject of work steps, for a man- 
agement rule as I am using the term is basically an ex- 
tension of the statement of a work step to include the 
conditions, actions, and exceptions which will apply 
under each set of possible circumstances. For each work 
step there must be governing management rules, and in 
composite these steps and their rules are a procedural 
extension of management objectives, plans, and policies. 
The incidence of management rules will, of course, de- 
pend entirely upon the incidence of variable conditions. 
In some situations, a standard and unvarying condition 
will exist and the only management rule needed is an im- 
plied, “Do this work step.” Such work steps are most fre- 
quently found in straight production-line situations 
where there is no choice and therefore no local decision. 

On the other hand, there are of course many situa- 
tions where multiple alternatives present themselves. 
Since each decision will have impact on all of the steps 
that follow, and, therefore, on the functioning of the en- 
tire system, it is absolutely essential that all possible 
selections must be carefully established and described 
within a frame of reference which can be no less than the 
total operation. 

I think I can develop this point and perhaps point up 
an interesting area for exploration by categorizing these 
management rules in two groups. 

The first group would include all rules which are made 
necessary by the nature of the enterprise itself. It would 
hardly be realistic to assume that any large-scale ac- 
tivity could be straight-lined from original plan and 
policy to final accomplishment, with absolutely no need 
for intermediate decision and selection of alternatives 
along the way. There are too many variables and un- 
predictables in the surrounding environment alone to 
permit such a condition. I believe I can demonstrate 
this rather obvious fact by a simple example, in terms 
of our own Air Force materiel support, and in terms of 
work steps and management rules which are a part of 
data processing. 

In the course of our accomplishment of materiel sup- 
port, one of the major sequential work steps might be, 
“Distribute this commodity from storage point ‘A’ to 
customer ‘B’.” Obviously, such a statement immedi- 
ately begets subordinate steps. One might be, “Process 
the customer’s requisition.” Now, as you can readily see, 
we have stepped off into the field of data systems, for a 
good part of processing a requisition is data processing. 
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We now come therefore to work steps and management 
rules which are of basic importance to the data system 
designer. One such work step might be, “Compare quan- 
tity requested with inventory balance.” Here the man- 
agement rules set in. For example, if the balance is ade- 
quate, take one action. If there is stock, but this order 
brings the balance below minimum levels for retention, 
take another. If there is no stock, there is a third type 
of action involved. No matter how precisely future sys- 
tems may permit the planning of requirements, unpre- 
dictable events in manufacture, delivery, or usage may 
operate to cause this condition. Therefore, if automation 
is to occur, management must make rules in advance to 
cover all situations that may arise due to the relation- 
ships of the enterprise to its surrounding environment, 
and management must assume responsibility for failure 
in this area. 

The second group would include what we might call 
deficit rules ; those made necessary by faults in design of 
the system itself. 

In a rather obvious example, one element of the 
data system might prescribe a work flow which termi- 
nates in the rendition of reports from a number of sim- 
ilar activities. Another element of the system must re- 
ceive and act upon these reports. If these reports are in 
standard format and content from all reporting agen- 
cies, and as needed by the receiving activity, then it is 
possible to proceed immediately to the next work step 
of operational value. But, if reports are not consistent, 
there is a need to provide management rules and further 
work steps which cover actions to be taken to discover 
and eliminate the various discrepancies. This might in- 
volve returning incorrect or incomplete reports to the 
sender, and suspending the remainder. It might involve 
making assumptions which would permit the process to 
continue. In any event, it causes the establishment and 
building into the system of a set of management rules 
which serve no more than a deficiency purpose, and for 
which responsibility must rest with the systems designer 
himself. 

If we design and create the system in segments, there 
is every reason to believe that with due care we shall 
eliminate the internal need for such deficit rules within 
each segment. However, with anything less than per- 
fect coordination of detail and rate of progress among 
those designing the various segments, such deficiencies 
will inevitably occur. The direct result: inefficiencies in 
the segments, cumulative inefficiencies in the entire sys- 
tem, and finally inefficiency and loss of effectiveness and 
economy in meeting the basic mission. I believe that at 
this point we can add this review of management rules 
to our earlier observations on work steps, and draw a 
rather obvious conclusion: to secure the most effective 
results, we must design and bring into being a system 
which extends from original input programs to final 
product delivery in an unbroken series of work steps and 
governing management rules, with physical materiel 
processing and data processing steps and rules, and 


their automation, regarded as inseparable and com- 
pletely interdependent. The composition of such a 
system must be limited entirely to rules and steps 
which actually advance the work at hand; and as a 
first order of business, the system deficiencies must be 
sought out and eliminated. This latter statement raises 
a further issue: the efficient organization of these work 
steps and rules. We have just observed that steps and 
rules entered by the systems designer as a deficiency ac- 
tion must be eliminated. This leaves the province of 
work step establishment and type one rule formulation 
entirely to management, and this is as it should be. 
However, once this has been accomplished, the system 
designer must step forward and assume responsibility 
for their most effective organization. This division of 
effort has always been true of manual methods; it is 
even more pertinent as we move deeper into process 
automation. This organization comes to final fruition in 
our detailed procedural flow and our computer routines, 
and the outer limits of effective automation will depend 
upon our ability to absorb and consolidate these rou- 
tines in larger and larger composites. 

Thus, we find ourselves in a situation where we must 
pursue an objective of attaining and maintaining com- 
plete systems integration, accompanied by maximum ef- 
fective automation. We must ask all of the many ele- 
ments of management to establish work steps and formu- 
late rules in the light of all others, and we must ask our 
systems designers to organize all of these steps for the 
most effective execution. 

Given this statement of need, we must now cast about 
for a control vehicle through which we can approach this 
task, and here we come upon a familiar subject. I refer 
to the various directive and procedural manuals and 
similar publications which are an integral part of any 
management system, in which all of these work steps 
and rules are traditionally expressed, and which must 
continue to serve in one form or another as our docu- 
ments of reference. 

At this point, we are confronted with very real con- 
siderations which are at once friend and foe: the Eng- 
lish language, and the human mind. 

You are all well aware of the limitations of the Eng- 
lish language as a device for definitive and precise com- 
munication in procedural publications. If this is true in 
a limited area, consider the problems involved in at- 
tempting to formulate, correlate, and record with pre- 
cision, all work steps and management rules for the 
whole of a large-scale system. And consider further the 
very limited ability of the human mind to comprehend, 
analyze, correlate, and evaluate all of these management 
rules under such conditions. 

Fortunately, we believe that there is the very crude 
beginning of a solution at hand, one which requires a 
great deal of imaginative and painstaking effort, but 
which offers great potential. This is central and standard 
control of the body of publication knowledge. I am 
sure that this approach has occurred to others in the 
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field of data system design, probably including at least 
some of you here today. 

One of our consulting firms has just completed con- 
siderable initial research in this area, utilizing some of 
our existing publications for analysis. The results indi- 
cate that while this is certainly no simple task, it is en- 
tirely possible, and we are now preparing to extend our 
efforts into further research and probably later into a 
major attack on some of our more critical areas. Spe- 
cifically those which are identified with our priority 
data subsystem development projects. 

This attack is guided by four basic principles: 

Principle No. 1 — Work steps and management rules 
can be removed from the usual narrative form and ex- 
pressed as independent entities, often in tabular form, 
and without consideration of data systems problem or- 
ganization or the type of processing equipment to be 
employed. This principle in itself will work to bring 
greater precision to the formulation and application of 
management rules. Further, and most important, such 
an expression will serve to close partially the language 
gap between management and data systems designers. 
There is some relief from the need for joint logical flow- 
charting; management, as the creator of steps and rules, 
need not concern itself directly with the most efficient 
organization of rules for machine processing. 

Principle No. 2 — Techniques are required which will 
permit the efficient processing, correlation, and control 
of the total body of publication knowledge, as a basis 
for coordinated formulation of all work steps and man- 
agement rules. This can be accomplished through the 
numerical coding of each English language information 
element, set, and system as they are used in procedural 
publications to describe these steps and rules. This per- 
mits the correlation of English words and phrases which 
are dissimilar of expression but have common meaning, 
and their reduction to specific and standard terms, 
which are in turn susceptible to consolidated filing and 
processing. 

Principle No. 3 — There can be established a central 
information repository containing all steps and manage- 
ment rules, expressed in terms of standard elements and 
sets of information, and indexed to all applicable com- 
ponents of the system, and to related publications. This 
library would serve as a control filter between manage- 
ment policy and the data system, by providing evalua- 
tion of the total impact of a proposed change to any rule 
or information element, a must in the interests of sus- 
tained integration. 

Principle No. 4 — Such a file is itself a very likely can- 
didate for automation. This assumes a rather extensive 
file, and the need for frequent reference, and it has been 
our experience that our own system would certainly 
meet these criteria. 

In summary to this point, we have proposed a quan- 
titative expression of the objective, the restating of all 
contributing work steps, and management rules, in the 
most logical and effective sequence across the entire 


system, their conversion to more precise and standard 
terms, and a central file and process which will permit 
management and systems designers to review a change 
to any sequential step or rule in relation to all others, 
an impossible feat in today’s large enterprise, but possi- 
bly not so in the future. 

Finally, we come to our last new horizon for today’s 
consideration: automatic machine programming. This 
subject may seem a bit apart from those I have been 
discussing, but I believe I can establish a close liaison. 
As you all know, programming is a very difficult task re- 
quiring a very scarce skill, and it is one of the major 
limiting factors of our rate of progress in computer ap- 
plication. In the light of this situation, we, like other 
computer users, have moved actively into the field of 
automatic, or general, programming. We define this as 
a technique whereby the computer is instructed in basic 
English verbs, and in turn the machine codes instruc- 
tions for itself when fed appropriately designed data 
to be processed. The criticality of this area has been 
such that we are applying some of our best talent in a 
major effort at improvement. While we are still far from 
final achievement, we have made considerable progress, 
and are quite optimistic about future potential. In addi- 
tion to practical relief from the programming task, 
achievement here has been considered essential to insur- 
ing the necessary degree of data systems standardization 
and integration among our computer centers. 

Beyond these direct systems benefits, let me now 
point up the close relationship between this field and 
those I discussed earlier. The specific situation is this: 
if we can achieve a high degree of direct correlation be- 
tween the input to automatic programming and the 
standard expression of work steps and management 
rules, we will have forged the final link in the chain, and 
will have a highly controlled and responsive means for 
progressing from original management policy and deci- 
sion to procedural implementation. We can then close 
the circuit by providing test decks and other analytical 
means for insuring that the information system has in 
fact produced the results desired by management in 
original decision and directive. 

This, then, is our view of where we must go in the 
future if we are to reap the full benefits inherent in the 
use of advanced equipments and techniques, and are to 
finally achieve the best in management control through 
systems integration and automation. 

In summary and analysis of the issues I have been 
discussing, it occurs to me that the basic substance has 
been this: The future of our systems work holds almost 
unlimited potential for gains in industrial efficiency, and 
that resulting benefits will continue to justify the ex- 
penditure of the required resources to achieve them. 

However, I cannot leave the subject at this. I suggest 
for your sincere consideration that if we as systems 
people proceed solely within this goal of efficiency, we 
are guilty of introspection ; we may miss completely the 
basic point and true objective of all our efforts to im- 
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prove the welfare of human beings. If there is a real rea- 
son for promoting this efficiency, it can be none other 
than the betterment of human living, whether through 
better defense of the nation, improvement of the 
standard of living, relief from manual drudgery, or 
whatever the specific and valid aim. 

In pursuing our goals of greater integration and 
automation by attempting to predict all eventualities 
and to prearrange solutions, we as systems workers, 


must never lose sight of this purpose, lest in our enthu- 
siasm we do harm to our real intent. We must assume 
major responsibility for insuring that every step for- 
ward meets the criterion of even greater human contri- 
bution all up and down the line, and therefore even 
greater human dignity. Progress in systems design and 
automation must always be measured within this larger 
perspective, and must forever be conditioned, and per- 
haps even limited, by this governing need. 


A Multiload Transfluxor Memory 

D. G. HAMMELf, W. L. MORGAN t and R. D. SIDNAMf 


Introduction 

I N the field of computing machinery there is an ever- 
increasing demand for the development of random- 
access digital memory-storage units that operate at 
higher speeds and provide greater storage capacities. In 
1951 a digital memory-storage unit that had a capacity 
of 1000 words and performed the basic memory cycle in 
about 200 /jl sec was quite sufficient to satisfy the needs 
of a large-scale data-processing system. Today memory 
units of large-scale data-processing systems are being 
designed to provide as much as 90,000 words of storage 
capacity and to perform the basic memory cycle in 
about 4 n sec. The trend is obvious, but the means of 
achieving desired results are often cumbrous. 

The development of a superior memory-storage de- 
vice is presently a major consideration of many prom- 
inent research activities. An important feature of the 
most promising schemes is the ability of the storage de- 
vice to perform a nondestructive readout. This means 
that the state of the storage device is not destroyed 
whenever a readout is performed. This is not the case 
with present-day magnetic core memories which destroy 
the stored information when the core is read, and conse- 
quently require that the information be written back 
into the memory if retention is desired. This in effect 
gives the nondestructive storage medium a 2:1 speed 
advantage over the destructive storage device. 

The read/write speeds of the memories being devel- 
oped with present-day techniques are approaching 
their maximum, and any further increases can be 
achieved only with decreased reliability and increased 
costs. These memories are capable of executing only one 
access at a time and therefore restrict the digital com- 
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puters to functioning sequentially. The ability of a 
memory to perform more than one access simultane- 
ously would be a major advancement in the computer 
field; this would be equivalent to increasing the read/ 
write speed of the memory cycle. But a much more sig- 
nificant aspect to this mode of operation is that it 
would make possible the practical realization of truly 
parallel computers, computers capable of simultane- 
ously and independently sharing the same memory or 
memories and hence able to communicate at the com- 
puter speed. A storage system which holds promise of 
fulfilling all these desirable features is a multiload trans- 
fluxor memory. 

The multiload transfluxor is a multiapertured mag- 
netic memory element employing the same type of high- 
remanent ferrite used in the ordinary memory cores. 
Thus the transfluxor is built upon a strong foundation 
of practical and theoretical ferrite core knowledge. The 
wealth of experience that has been accumulated during 
the development and use of coincident current magnetic 
core memories is applicable. In addition the transfluxor 
offers many properties heretofore unobtainable. 

The Two-Hole Transfluxor Memory 

The original Raj ch man and Lo transfluxor is a two- 
hole ferrite core. 1 The following is a brief explanation 
of the device. (See Fig. 1.) 

The large hole is used for the writing operation and 
the small hole for reading. There are two parts to the 
writing cycle: a block pulse and a set pulse. The block 
pulse is a large pulse, either positive or negative, which 
saturates the entire core in one direction. When the core 
is blocked, the flux direction on both sides of the small 

1 A complete explanation of the device may be found in J. A. 
Rajchman and A. W. Lo, “The transfluxor,” Proc. IRE, vol. 44, pp. 
321-332; March, 1956. 
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hole is the same [Fig. 1(b) ]. The set pulse is a restricted 
smaller pulse of opposite polarity which reverses a 
portion of the flux in leg one and all the flux in leg 2. 
This is the set condition of the core [Fig. 1(c)]. The 
transfluxor is read by applying sine waves or pulses 
of alternating polarity to the small hole to sense the 
presence or absence of a set condition. If the core is 
set, the following occurs. First, a prime pulse, which 
produces a clockwise path around the small hole [see 
Fig. 1(d)], is applied to leg 3. A drive pulse is then ap- 
plied to leg 3 to produce a counterclockwise flux path. 
The reversal of the direction of the flux path around the 
small hole by the prime and drive pulses generates an 
emf which is sensed by a winding on leg 3 [see Fig. 1(e) 
and 1(h)]. The set condition of the core and subsequent 
readout of a sense voltage is the equivalent of writing 
and reading a “1.” 





CORE CORE DRIVING 

(g) (h) (i) 

Fig. 1 — Transfluxor operation. 


In order to write a “0” in the core, the write opera- 
tion must be modified so that the core remains blocked. 
This is accomplished by an inhibit winding through the 
large hole. At the same time the set pulse is applied, a 
half-current pulse of opposite polarity is applied to the 
inhibit winding. This half-pulse thus nullifies the set 
pulse and the core remains blocked. With the trans- 
fluxor blocked, the prime-drive pulses will not reverse 
any flux in the vicinity of the small hole and conse- 
quently will not generate any sense voltage [see Fig. 


1 (g) — l(i) ]. Thus the generation or nongeneration of a 
sense voltage is the equivalent of reading a “1” or a “0” 
from the core. 

The applicable addressing techniques for transfluxor 
memories are similar to those used for magnetic core 
memories. Each of the two transfluxor holes may have 
its own set of selection wires as shown in Fig. 2. Be- 
cause there are separate addressing systems for both 
reading and writing, and because there is negligible in- 
teraction between aperture signals, it is possible to 
write in one location of the array while simultaneously 
reading in another location. 


HALF PRIME a HALF BLOCK a 
DRIVE SET 



Fig. 2 — Coincident current transfluxor array. 


An important feature of the transfluxor is its ability 
to perform a nondestructive readout. This ability of the 
transfluxor memory system offers significant advan- 
tages to the digital computer. Because of the nondestruc- 
tive nature of the transfluxor, there is no need to restore 
the transfluxor to its original state after interrogation. 
This results in the elimination of a major portion of the 
read/write cycle normally associated with memory 
systems, and the timing requirements are appreciably 
simplified. In effect the speed of the memory cycle is 
increased by approximately 2:1. 

This nondestructive read characteristic is also signifi- 
cant in that there are no ruinous effects when a transient 
error occurs during a read operation. True, the informa- 
tion in a memory location may be read incorrectly due 
to a transient noise but this does not affect the contents 
of that memory location since the data do not have to 
be rewritten from the output. A computer that has a 
nondestructive memory can easily cope with transient 
noise by immediately repeating the read operation upon 
detection of a read error. The ability to read from the 
memory without destroying its contents is a desirable 
feature especially in real-time computer applications 
where the execution of the stored program must be re- 
iterated indefinitely. 
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The Multiload Transfluxor 

One of the attributes of the transfluxor is signal isola- 
tion between the windings of the small hole and the 
large hole. Through the proper physical placement of 
additional small holes in the ferrite body and a separate 
set of addressing wires and sense windings in each small 
hole, it is possible to obtain more than one independent 
output from the transfluxor without seriously affecting 
the signal isolation between holes. Thus it is possible 
to consider three, four, five, or more holes in the trans- 
fluxor, offering the designer an extremely versatile mem- 
ory storage component. This new component is called 
the multiload transfluxor. 

For the purpose of discussion, consider a multiload 
transfluxor with a large hole and two small holes. If only 
one of the small holes is used for readout, the perform- 
ance of the multiload transfluxor is identical to the two- 
holed transfluxor. A possible configuration of a dual- 
load transfluxor and its flux patterns with only one of 
the small holes being pulsed for readout is shown in Fig. 
3(a)-3(d). 

The cores shown in Fig. 3 are pulsed by the coinci- 
dent-current address method. This means that two 
pulses, one on the X wire and one on the Y wire, are 
needed to supply the total current required to generate 
the proper flux condition in the core. These flux patterns 
are nearly the same as those for the more conventional 
transfluxor. If the prime-drive cycle is simultaneously 
initiated in both small holes, the resulting flux distribu- 
tions are shown in Fig. 4(a) and 4(b). Notice that there 
is minimum interaction between the holes. This permits 
the timing of the priming and driving pulses for each 
hole to be independent of the other. 

The multiload transfluxor as used in a memory sys- 
tem has a separate set of addressing wires and sense 
windings in each small hole. Each set of wires is tied to 
independent address registers and loads. All address 
registers are capable of addressing randomly any core 
in the array and any core may be addressed by all the 
registers. Therefore the information stored in one core 
may be read out simultaneously to any or all loads, and 
any number of cores may be read independently into 
different loads. 

With only one large aperture in each multiload trans- 
fluxor, there can be only one write/addressing source for 
each word. This limitation is imposed on the large aper- 
ture because each inhibit winding is common to every 
bit in its memory plane; so it can only represent one ad- 
dressing source. Of course, multiple writing could be 
made possible by eliminating the inhibit winding, but 
this scheme involves many more control circuits. A 
simple and quite satisfactory method of accomplishing 
multiple writes is to use a split-memory system. The 
split-memory system is discussed later in this paper. 

There are two techniques for selecting cores or core 
registers mentioned in this discussion: coincident-cur- 
rent selection and external word-selection (end firing). 



PRIMING BOTH HOLES DRIVING BOTH HOLES 




Fig. 4 — Flux patterns of a dual-load transfluxor with both small 
apertures being pulsed ; double-load readout. 


The coincident-current technique is illustrated in Fig. 
5. It is used here to address three cores simultaneously 
from three independent sources. 

The wire leads in the array are designated by a code 
which describes the addressing source, the addressing 
axis, and the row or column number of that axis. For 
instance, LF3 indicates that the origin of the lead is 
the third column in the Y axis of the L addressing 
source. 

In this figure the U addressing source transmits half- 
current priming signals on the UX 4 and UY3 leads. Ac- 
cordingly, the upper apertures of cores 41, 42, 44, 45, 
13, 23, and 33 all receive half-current pulses, but this 
half current is not sufficient to significantly disturb the 
flux pattern around the small aperture. This is a basic 
requirement of any coincident-current magnetic-core 
memory. However, core 43 receives half currents on 
both the UX and U Y axis leads and the coincident 
summing of these currents causes a flux reversal in the 
vicinity of the small aperture. Thus, core 43, and only 
core 43, is primed for reading by the U addressing 
source. Similarly, the L addressing source selects core 13 
for reading also, while M addressing source selects core 
22 for writing. 
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Fig. 5 — The effects of simultaneously pulsing three cores; not shown: 
inhibit wires, sense wires to loads 1 and 2. This is only one plane 
of the memory. 


In contrast to the coincident-current technique of 
selecting a word by summing the half currents of the X 
and Y axis leads, the external word-selection technique 
involves one wire that carries the full current to the se- 
lected word. A multiload transfluxor memory using the 
external word selection is shown in Fig. 6. Memory 
planes, composed of printed multiload transfluxor 
plates similar to those of a ferrite-plate memory, are 
stacked with the selection wires threaded in the word 
dimension. There are three apertures per bit, with one 
selection wire through each aperture, thus requiring only 
3 physical wires per bit. All other windings can be 
printed on the ferrite plate, since they are all in the 
X- Y plane. These printed windings consist of one con- 
tinuous inhibit winding per plate through each large 
aperture and two continuous sense windings per plate, 
one through each upper small aperture and the other 
through each lower small aperture. The X and Y ad- 
dressing signals are supplied to an external circuit using 
either magnetic or transistor switches which provides 
the gating to select a specific word. 

The advantages of this technique are as follows: 

1) There is only one selection wire required per hole 
of a transfluxor. This eases the congestion and 
reduces the task of threading wires; 

2) The noise problem due to the half-current selec- 
tion pulses is eliminated; and 

3) The specifications on the current amplitudes of 
block and drive pulses are less critical than in the 
normal array. 

The multiload transfluxor for most memory design 
purposes imposes no additional restrictions than are re- 
quired for toroidal cores. In fact, other selection meth- 
ods may prove more advantageous than the ones de- 
scribed. 



Fig. 6 — Switch-driven memory-employing transfluxors. 


LEG I 
INPUT' 


LEG 3 
INPUT' 


LEG 3 
OUTPUT 


s ' s 

i 

If 

II 

~j If 

B 

n 18 

1 i 

ii r 

u 

F 

1 

mi 

> i F 

! 
f 

1 

1 

1 

i u" 

> P 

, BASIC PUL! 
1 PATTERN 

i 

1 .1 

1 

1 1 

r\ * 


vJIUTWLUUirui UCIMIL 



0.1/i SEC/DIV 

r l = io.ooo y\. 


LEG I INPUT 

BLOCK 
-10 AT.! 

LEG 3 INPUT 


LEG 3 OUTPUT 


f SET + 2 A.T 




OUT 


SIGNAL OUTPUT DETAIL 


2 VOLTS/ 
TURN/DIV 































z 

5 






- 

z 














— 

a: 


i'i . 

i i t i i 

0 1.0 2.0 3.0 SEC 

TIMING SEQUENCE 
(r l = io,oooyv) 


0.1 fi SEC/DIV 
R l = 90 /V 


Fig. 7 — A test program for a transfluxor of the large size. 


Laboratory Test Data 

A considerable amount of experience has been gained 
by RCA in studies on the transfluxor itself and in de- 
velopment of a transfluxor memory for an airborne 
digital computer. The characteristics of transfluxors 
which make it ideally suited for high-speed digital mem- 
ories were laboratory tested. The results are described 
here briefly; some data and waveforms are shown in 
Fig. 7. 

The test setup used a large core (364-mil O.D.) which 
requires rather large currents. In practical memories 
smaller cores will be used with more reasonable cur- 
rents. A memory is now being constructed using trans- 
fluxors with an outside diameter of 200 mils. 

The currents used in these test circuits were abnor- 
mally large to accentuate certain conditions. The block 
pulse, for instance, was maintained at 10 amp turns to 
observe clearly the effect of the set pulse amplitude upon 
the output. The pulse pattern illustrated in Fig. 7 shows 
a driven output of 6 volts per turn (peak) with a pulse 
width of 0.18 nsec (at 50 per cent of pulse amplitude) 
and a switching time of 0.30 n sec. With such a large out- 
put it may be possible to eliminate the sense amplifier 
in certain applications. 
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Under laboratory conditions the prime-drive (non- 
destructive readout) cycle on a single core has been com- 
pleted within 0.8 n sec. A block-set sequence has been 
completed in 2.5 fx sec. Faster speeds appear to be possi- 
ble, but could not be observed due to limitations im- 
posed by the testing equipment. 

The nondestructive readout properties were tested by 
setting one core and blocking another. A stream of bi- 
polar (prime and drive) pulses were applied to both 
cores. After approximately 3,600,000,000 readouts, the 
outputs were the same as at the start of the test. 

One of the problems common to ferrite devices is their 
temperature dependence. In most memories this prob- 
lem is avoided by placing the entire core assembly in a 
temperature-maintained environment such as a 40°C 
oven. Typically a ±3°C range is maintained. Work has 
been done by Abbott and Suran 2 and by Bennion and 
Crane 3 on stabilizing the logical transfluxor for wide 
temperature variations, but a temperature-maintained 
oven still appears advisable for coincident current 
memories. 

Computer Memory Applications 

There are four basic paths of communication between 
the memory and other sections of a digital computer: 

1) New data are transmitted to the memory from the 
input unit; 

2) Stored data are transmitted from the memory to 
the central computer for processing; 

3) Results data are transmitted to the memory from 
the central computer; 

4) Answer data are transmitted from the memory to 
the output unit. 

In general, present-day computer memories can per- 
form only one of these functions per memory cycle. This 
means that while the memory is communicating over 
any one of these basic paths, the other three communi- 
cation paths are stymied. This inherent interference is 
a limiting factor in realizing the full operating potential 
of a digital computer. 

The introduction of a computer memory that has the 
ability to communicate over all four of these paths 
simultaneously would offer effectively higher speeds of 
operation and other advantages. Such a memory is now 
possible through the use of the multiload transfluxor. 

Conventional Computer Organization 

One suggested organization of a conventional digital 
computer utilizing a three-hole transfluxor memory is 
shown in the block diagram of Fig. 8. This memory unit 
is controlled independently from each of three sources: 
the output unit, the input unit, and the central com- 
puter. The interconnections between these three sources 

2 H. W. Abbott and J. J. Suran, “Temperature characteristics of 
the transfluxor,” IRE Trans, on Electron Devices, vol. ED-4, pp. 
113-119; April, 1957. 

3 D. R. Bennion and H. D. Crane, “Design and analysis of MAD 
transfer circuitry,” this issue, pp. 21-36. 



Fig. 8 — Computer memory organization using a 
three-hole transfluxor memory. 


and the memory unit are as follows. One of the two 
small apertures in each transfluxor is used for reading 
data from the memory to the output unit. The other 
small aperture in each core is used for reading data from 
the memory to the central computer. The large aperture 
in each transfluxor is used to write data into the memory 
unit from either the input unit or the central computer. 
Since both of these sources could simultaneously initiate 
memory-write instructions, a switching and priority 
unit must be provided to make a decision as to which 
should take precedence and control the flow of informa- 
tion accordingly. Until the execution of the selected 
write instruction is completed, the other write instruc- 
tion is temporarily held up. 

The logical operation of this proposed digital com- 
puter presents another minor restriction in that it is not 
permissible to read and write in the same memory loca- 
tion simultaneously. Two comparator checking circuits 
are required to check for coincidence of like read and 
write addresses. When like read and write addresses are 
detected, the procedure is to interrupt the performance 
of one of the instructions until the other is completed. 

Except for these two restrictions, the output unit, the 
input unit, and the central computer are able to execute 
all read and write instructions independently, simul- 
taneously, and asynchronously. They can function inde- 
pendently because each source is capable of selecting its 
own memory location regardless of the performance of 
the other source. They can function simultaneously be- 
cause there is no problem of interference when data are 
read from two memory locations and written into an- 
other memory location at the same time. In fact, there 
is no interference when both read sources simultane- 
ously select the same memory location. They can func- 
tion asynchronously because good signal isolation 
makes different timing sources permissible. 

Split Memory 

The inability of a computer to write into the memory 
from both the input unit and the central computer 
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simultaneously may be solved by another approach 
more compatible with the philosophy of parallel opera- 
tions. It is a split-memory system shown in the block 
diagram of Fig. 9. In this memory system, one portion 
stores only the data from the input unit and the re- 
maining portion stores only the central computer data. 
Thus there are effectively two separate memory units 
for writing information, but only one for reading by the 
central computer and output unit. 

An important point to emphasize here is that the 
memory can be divided into two or more sections in 
order to accommodate simultaneous inputs to the mem- 
ory from two or more sources. This division can be 
made in such a way as to satisfy the relative storage 
needs of the inputs. For instance, the memory can be 
divided so that 40 per cent of it is addressable for writ- 
ing by the computer, 30 per cent of it is addressable for 
writing by a tape input, and 30 per cent of it is ad- 
dressable for writing by a magnetic drum. It is obvious 
that there is a great number of possible arrangements 
of such a memory system. 

An advantage of this digital computer organization is 
that there is no interference between the write instruc- 
tions of the input unit and the central computer and 
thus no need for a switching and priority unit. As a re- 
sult there is a definite saving in computer operating 
time. However, this advantage is gained only at the ex- 
pense of a somewhat increased demand for memory- 
storage space and the introduction of two more com- 
parator-checking circuits. 

Parallel Operations 

Some of the possible uses of a three-hole transfluxor 
have been given above. Additional small holes in the 
multiload transfluxor further increase its versatility. 
The possibility of having many small holes for reading 
purposes permits the realization of heretofore unachiev- 
able system designs. In real-time computer systems, it 
is desirable to be able to read large quantities of data 
continuously to output devices without interfering with 
other memory communications. In a missile control 
center, for example, there is a need for high-speed 
printers, tote-board display units, CRT display units, 
and tape units. Data transmitted to the CRT display 
units must be renewed at a rate sufficiently fast to elim- 
inate flicker. The tote-board display must be able to 
exhibit pertinent information that is frequently up- 
dated to keep personnel abreast of the happenings. Con- 
currently, the tape units and high-speed printers must 
also be able to record in real time. If all of these output 
devices require access to the same data, the memory 
must consist of multiload transfluxors which have one 
small aperture for each output device. However, if the 
data required for the various output devices are differ- 
ent, the transfluxor memory unit can be split for reading 
in a manner similar to that discussed previously for 
writing. In this case, the memory would consist of three- 
hole transfluxors with one small aperture of each con- 
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Fig. 9 — Split memory system variation of Fig. 8. 


nected to the one applicable output and the other small 
aperture connected to the central computer. With 
either configuration there is complete independence of 
reading by any or all output devices. 

The development of the multiload transfluxor mem- 
ory will make it practical to design and develop central 
computers that operate on multiple addresses in par- 
allel. This means that instead of the conventional 
method, which must address each word required by the 
problem in sequence, all words may be addressed simul- 
taneously in a transfluxor memory. Some novel com- 
puter designs based on this principle follow. A com- 
puter arithmetic unit could be designed to accept both 
the augend and addend data simultaneously. Computer 
logic based on a parallel three-address instruction could 
permit simultaneous reading of two operands to the 
arithmetic unit and writing of previous adder results 
into the memory unit. It is also conceivable to develop 
a sorting unit that simultaneously accepts and sorts 
three, four, or more inputs. Such a device would be a 
welcome addition to business computers where a major 
per cent of the time is spent sorting. 

Common Memory 

It is becoming increasingly important in the fields of 
science, computer checking, and real-time applications, 
where the speed of computation exceeds that permitted 
by one high-speed computer, to devise systems which 
link two computers together in order to handle ade- 
quately complex problems. One method, which is in use 
at Holloman Air Force Base, accomplishes this by using 
a conventional memory as the communicating link be- 
tween computers. The basic operation of this type of 
system is illustrated in Fig. 10. Separate transfer mem- 
ories A and B serve as links between the two computers. 
Computer A writes into transfer memory A any data 
which are needed by Computer B. Computer B may 
then read these data, process them, and either store the 
result in its own memory or in transfer memory B from 
which computer A may then read the data out. This 
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READ-B 



READ-A 

Fig. 10 — Two-computer communication link using 
a transfer memory organization. 


two-computer system offers certain advantages in speed 
over the operation of a single computer. 

Another method of linking two computers is being 
used at the National Bureau of Standards. In this sys- 
tem one computer, called the Secondary, is a slave to 
the other computer, called the Principal. A similar or- 
ganization of units within one computer network is used 
in the IBM 709 computer. Here the Data-Synchronizing 
Unit is comparable to the Secondary, and the Central 
Processing Unit is comparable to the Principal. The 
CPU can assign the DSU an input-output task and 
then proceed with its own program. The DSU proceeds 
independently, with the exception of memory access 
which it must share with the CPU, until it completes 
its assignment. In this system the program for the DSU 
is essentially wired in and the memory is time-shared. 
The National Bureau of Standards system is shown 
functionally in Fig. 11. The Principal has the authority 
to assign part of a problem to the Secondary. However, 
for the Secondary to perform its task it must be supplied 
with assignment instructions and data. This transfer 
of data interrupts the operation of the computer A 
memory and thus is time-consuming. 

Now consider the advantages of using a “common 
multiload transfluxor memory” as the communication 
link between two computers. The functional linkage of 
such a system is shown in Fig. 12. The common memory 
is split into three sections to permit the execution of in- 
dependent writing operations by both computers and 
any input-output devices. However, the memory is not 
split for reading, so both computers and other devices 
can have independent access to any data in the common 
memory. This common memory will permit three read 
and three write operations to occur simultaneously. 

The significance of this system organization is that 
the common memory, in addition to acting as a com- 
munication link between computers, also serves as the 
memory-storage unit for each computer. As a result, 
there is no transfer time involved in communicating be- 
tween computers. This saving of time is achieved be- 
cause each computer independently has direct access to 
any data pertinent to the functioning of the other com- 
puter. 

Another significant feature of this organization is that 
the common memory will need less storage capacity 
than the total amount required by two individual com- 



Fig. 11 — Two-computer communication link using a 
Principal-Secondary computer organization. 



Fig. 12— -Two-computer communication link using a common 
multiload transfluxor memory organization. 


puters. This reduction in memory storage capacity can 
be realized because 

1) No transfer memory-storage units are required 
since each computer has direct access to any data 
pertinent to the other computer; and 

2) Basic subroutines, stored constants, and input 
data are available to both computers, since both 
have access to all the memory-storage locations. 
Thus the duplication of common data is elim- 
inated. 

The concept of a master-slave relationship can be 
carried over to a common memory system but with in- 
creased flexibility. The necessity of assigning large 
blocks of data and several tasks at once to the Sec- 
ondary in order to keep it operating continuously is 
practically eliminated. Since all data are directly avail- 
able to both computers, only the initial address of sev- 
eral stored programs needs to be assigned. In addition 
the roles of Principal and Secondary can be inter- 
changed at will so that either computer can assign a 
task to the other. 

The system organization as proposed in Fig. 12 re- 
quires a four-hole transfluxor memory unit. With a five 
or more hole transfluxor memory, it is possible to expand 
this organization to incorporate three or more com- 
puters. The more computers in the network, the more 
efficient the common memory becomes. 
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Conclusion 

The basic mechanisms of transfluxor operation have 
been shown and a few examples given on how this versa- 
tile component may be used. 

The multiload transfluxor is constructed by placing 
additional reading apertures in the core and wiring each 
hole for separate addressing. This permits many read 
operations to take place throughout the memory at the 
same time. Each readout is delivered to its own inde- 
pendent load. The nondestructive read property elim- 
inates the rewrite time associated with conventional 


core memories; this feature permits cutting the read 
time in half. 

It appears that, by utilizing these properties, consid- 
erably more flexibility and speed can be built into a 
transfluxor memory. The independent operation of the 
various parts of the memory would facilitate communi- 
cation between sections of a computer or between two 
computers and would permit all parts of a computer 
network to operate without delays due to memory time- 
sharing. This would be a major advance in computer 
design. 


Design and Analysis of MAD Transfer Circuitry* 

D. R. BENNION f and H. D. CRANEf 


I. Introduction 

T HIS is the second in a series of papers 1 concerned 
with a technique for performing combinatorial 
and sequential digital logic with magnetic ele- 
ments and connecting wires only. These elements are 
termed MAD’s (Multi- Aperture Devices). For clarity, 
in the first paper the basic techniques were described 
in terms of simple circuit structures which do not repre- 
sent the best that can be achieved in the way of opera- 
tional properties. The object of this paper is: 1) to 
present circuit techniques for significantly improving 
the circuit operation; and 2) to present experimental 
and analytic results which are pertinent to an under- 
standing of the coupling loop operation. 

The basic coupling loop and clock cycle are briefly re- 
viewed in the next section. 

II. Review of Basic Coupling Loop 
and Clock Cycle 

The circuits discussed here use only POSITIVE 
MAD elements 1 (although the results are applicable to 
circuits using other types of MAD elements) . Each ele- 
ment has at least two small apertures, one used for an 
output winding and one for an input winding. The out- 
put winding of one element connects with the input 
winding of another to form a coupling loop, and in this 
way a pair of electrically connected elements is formed. 
As information is shifted along a chain of elements, each 

* This work was carried out at the Stanford Research Inst., 
Menlo Park, Calif., under the sponsorship of the Burroughs Corp. 
Res. Center, Paoli, Pa. 

t Stanford Res. Inst., Menlo Park, Calif. 

1 H. D. Crane, “A high-speed logic system using magnetic ele- 
ments and connecting wire only,” Proc. IRE, vol. 47, pp. 63-73; 
January, 1959. 


element alternately plays the role of a receiver and 
transmitter. 

Output Aperture 

An element can be in either the Set (binary one ) or 
Clear (binary zero) state, Fig. 1. Typical 4> t -Ft (where 
Ft is the driving mmf N t It) curves for the output 
aperture of a transmitter for these two states are illus- 
trated in Fig. 1(c). If the element is in the Set state, 
Fig. 1(b), then flux changes locally about the output 
aperture in Response to small values of mmf Ft, whereas 
if the element is in the Clear state, Fig. 1(a), flux can 
change only about a path enclosing both the output and 
central apertures. Because of the longer, path length in 
the latter case, larger switching mmfs are required. 
(Subscript T indicates that this winding is connected 
with the transmitter end of a coupling loop.) 

Input Aperture 

A receiver element is always cleared to its zero state 
before transmission into it. It operates then only along 
a <Pr-Fr Clear curve (where Fr is the mmf NrIr ) 
which is essentially the same as the Clear-state curve 
for the transmitter, since the relevant path lengths are 
the same, Fig. 2(a) and 2(b). 

An important property of a system in which windings 
connect with apertures, as indicated here, is that once 
an element is Set, it is impossible to Clear it from any 
aperture winding. In Fig. 2(c), an element is shown Set, 
as a result of current Ir. In Fig. 2(d), the same element 
is shown after a subsequent “negative set” current, 
— Ir. Note that as a result of the negative set current 
flux changes only locally about the input aperture with- 
out disturbing the flux about the output aperture. 
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Fig. 1 — Output aperture properties. 




Fig. 2 — -Input aperture properties. 


Coupling Loop 

To form a coupling loop, the output winding of one 
element is connected to the input winding of the next 
(Fig. 3). If the transmitter is in the zero state, then 
transmission is trivial in the sense that the receiver is 
already in the zero (reference) state. It is necessary then 
that the Advance current I a be controlled in value so 
that in this case no flux switches in either element. With 
the transmitter in the one state, it is desirable that I R 
(and hence I a) be as large as possible to facilitate flux 
transfer. For best transmission then, it is desirable to 
make Ia as large as allowable. The largest allowable 
value I a is determined by the zero state condition, and 
is equal to 

Fz Fz 

I a™ = I T m + In m = — + — 

N t Nr 

where Fz is the Clear-state threshold mmf (Figs. 1 and 
2). In this case, assuming that the current divides into 
It = Fz/N t , and Ir = F 2 /Nr, both devices are brought 
just to their thresholds Fz, but no flux is switched. Thus 
a zero is “transmitted” as a zero. If the transmitter is in 
the Set state, however, the transmitter has a much 
lower threshold, Fi, so that IrJ>It and the Advance 
current I a causes flux switching in both the transmitter 
and the receiver. Thus the receiver is set to the one 
state. 

Clock Cycle 

A clock cycle for operating a two-MAD-per-bit shift 
register is briefly reviewed, Fig. 4. The elements are 
considered in two groups, arbitrarily labelled O (odd) 




Fig. 4 — Clock cycle. 


and E (even) . Information is stored in one group while 
the other is being cleared. The information is then 
shifted to the cleared group, while the other group is 
cleared, etc. 

To get started in the cycle, assume the data are 
initially stored in the O elements. The ADV O— »E pulse 
shifts the information to the E elements. The O ele- 
ments are then explicitly cleared by the CL O pulse. 
The ADV E— >0 pulse shifts the information back to 
the O elements, and the E elements are then cleared by 
the CL E signal, etc. Thus the basic cycle is 

• • • , ADV 0 — » E, CL O, ADV E — > 0, CL E, • • • . 

By tracing the details of the data transmission, one 
may see how the MAD elements and basic clock cycle 
work together to yield unilateral transmission. As pre- 
viously indicated, transmission of a zero is trivial. 
Transfer of a one is followed in detail in Fig. 4. The 
statements made with reference to this particular one 
transfer are true as well for every O— >E loop simultane- 
ously transferring a one. 

Flux switched in any leg as a result of a particular 
pulse is indicated by a heavy arrow. The ADV O— »E 
pulse switches flux locally about the output aperture of 
the O element and causes the E element to be set. The 
CL O pulse then clears the O element and in so doing 
switches flux through the output winding. This results 
in a loop current flow that negatively sets [see Fig. 2(d) ] 
the E element (receiver) without affecting the flux state 
about the output aperture of the E element. Note that 
neither the ADV 0-^E nor CL O pulse causes any flux 
to be switched in the output leg of the E element (in- 
dicated by the crosses opposite the output winding), 
eliminating thereby the need for the conventional series 
coupling diode. ADVANCE E^-0 shifts the binary one 
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to the next 0 element in the direction of transmission, 
and CL E then clears the E element. Note again that 
neither the ADV E— >0 nor CL E pulses cause any flux 
to be switched in the input winding of the E element, 
eliminating the need for the conventional shunt diode 
to prevent backward transmission. Thus unilateral data 
flow is achieved. 

Unity Turns Ratio Operation 

Coupling loops can be successfully operated under 
conditions where N t <Nr. The condition N t = Nr is 
particularly interesting for shift register circuits since 
the coupling loops are inherently symmetrical, and 
shifting may therefore be made to occur in either direc- 
tion by control of the clock sequencing. Furthermore, 
the special case Nt = Nr = 1 results in very simple 
mechanical assemblies. The condition N t <Nr does not 
appear to have any particular advantage, but the con- 
dition N t >Nr is important as a means for obtaining 
flux gain when bi-directional coupling loops are not re- 
quired. Operation with unity-turns-ratio, i.e., Nt = Nr, 
is discussed in Sections V and VII. 

In order to simplify the mathematical relations all 
circuit descriptions will be for symmetrical unity-turns- 
ratio coupling loops. The extension to the more general 
case N t >N r is relatively straightforward. 

Advance Current Range 

It is well known that magnetic circuits that operate 
in the region of threshold are inherently slower and less 
tolerant to clock current variations than would be 
similar circuits not so limited. It is important therefore 
to determine and to take advantage, insofar as possible, 
of all techniques for improving these allowable operating 
ranges. 

Relations for advance current range are derived be- 
low for the coupling loop circuit of Fig. 3, using a very 
simple model in which we assume 1) that the one , zero 
(j) T -F T curves have vertical steps at threshold Fi and 
Fz (Fig. 5) ; 2) that if a transmitter element is in the 
zero state, then the circuit must be limited so that as a 
result of the Advance pulse, the receiver is not brought 
over its Clear state threshold F 2 ; and 3) that if the 
transmitter is in the one state, then as a result of the 
Advance pulse, the receiver element and transmitter 
element are completely switched. For this latter condi- 
tion, the receiver must receive a net drive of at least F 2 
and the transmitter a net drive of at least F%. Although 
this model is extremely inadequate (see Sections VII 
and VIII), it is very useful for comparative estimating 
purposes. 

For the circuit of Fig. 5(a), the maximum value of Ad- 
vance current id max is determined by the zero transfer 
condition. In this case, I A max = 2(F 2 / N) where F 2 /N is 
the current required in each branch to just bring its cor- 
responding element to its Clear state mmf threshold F%. 

The minimum value of Advance current, J^ min , is de- 



(c) ( d ) 

Fig. 5 — Switching model for basic coupling loop. 


termined by the one transfer condition. The equivalent 
circuit of Fig. 5(d) can best be used for visualizing the 
relations, so that 

Fi F2 

I A mm = / r mm _j_ J R mm _ 1 . 

N N 

Therefore, the percentage range R for the Advance cur- 
rent, defined as 

{I a max - /,i min ) X 100 

R = 

/a av 


is equal to 


R - 


2 


F 2 F\ 

-3F 2 + F x _ 


x 100. 


( 1 ) 


For comparative purposes, it is interesting to consider 
the limiting case when Fi = 0. Under these conditions, 
the limiting value of range, R°, is 

R° = 67 per cent. (la) 


In the sections to follow, circuits that exhibit sig- 
nificantly greater range will be introduced and the ex- 
pressions for range determined for these may be com- 
pared with the relation derived here. 

It may be noted that in these circuits only the Ad- 
vance current range is of concern since the Clear current 
range is essentially unlimited, as long as it is above the 
minimum value required for adequate clearing of the 
elements. 


Switching Speed 

It is well known that the rate of switching of a 
“square loop” magnetic material is approximately pro- 
portional to the amount of (excess) drive, over and 
above the threshold value. Thus, two thin rings (of the 
same material) of radii r 1 and r 2 would switch at the 
same rate if driven with mmf’s in the ratio ri/r 2 . 

For the switching problem at hand, we will estimate 
the switching speed with the Advance current set in the 




to 


TABLE I 

Summary of Range and Speed Relations Derived in Text 


Figure 

Condition 

R [Range in fraction of 100 per cent ] 

F°[F for Fi = 0] 

Number of switching thresholds 
n measured at Ia rv 

/n° is value \ 
n° — 1 ( J 

Vfor Fi = 0 / 

5 


M F *~ F '\ 

2 

3F 2 + Fi 

1 



" V3F 2 + fJ 

3 

2(Fx + F 2 ) 

2 

9 

let F 2 B = F -2 + Fb 

o / FP — Fi \ 

2 

3F 2 + F.g + Fi 

1 ^ 

“ \3F 2 b + Fj 

3 

2(Fi + F 2 ) 

2 \Fb/ 


N > 4 N b 

„ I" {Ft- Fi)N + 2 FxN b -1 

2N 

r(3F 2 + Fi)N - (4 F 2 + 2F 1 )2Vb1 

2V 


~ L(3F 2 + Fi)N - (4 F 2 + 2Fx)N b \ 

32V - 42Vb 

L 2(Fi + F 2 )(2V - 2N b ) J 

2(2V — 22V b ) 

10 

N < 4 N b 

0 T F 2 N — (2 F 2 + Fi)2VB~j 

2(2V — 22V*) 

r F 2 2V + FNb “J 

2V - 22V S 

L F 2 N + FiNb j 

N 

L2(F 1 + F 2 )N b J 

2N b 


N = 4 N b 

„ r2 f 2 - Fn 

1 

r 4F 2 + Fi -j 

1 


~ l_4F 2 + Fi J 

1.2 (Fi + F 2 )J 



„ r (F 2 - Fdc - Fx)N + 2F 1 N b -| 

22V 

r (3F 2 - Fdc + Fi)2V - (4F 2 + 2Fi)Nb “I 

(F 2 - Fdc)2V 


“ L(3F 2 - 3F d0 + F 1 )N - (4F 2 - 4F dc + 2F02V/J 

32V - 42Vb 

L 2 (F x + F 2 )(2V - 22Vb) J 

2F 2 (2V - 22V b ) 



„ r {Ft + Fd c )N - (2 F 2 + FOiVs “I 

„ T (F 2 4- Fd 0 )2V — 2F 2 Nb “| 

(F 2 + Fdc)2V + Fi2Vb 

2V(F 2 + Fdc) 1 


~ L(F 2 + Fda)N + (Fi - 2 F dc ) 2 vJ 

“ L(F 2 + Fdc)2V - 2F de N B J 

2(Fj + F 2 )Nb 

2N b F 2 

11 

„ ( iN ‘ A. 

r 2F 2 — Fi -1 

2 F 2 

r 4F 2 + Fi -1 

1 


Fde ~ V N ) 2 

" L4F 2 - 2Fdc + Fi J 

2F 2 — Fdc 

L2(Fi + F 2 )J 


N = 2N b 

i — 

sC 

1 

e? 

<N 

1 

C 

2 

r 4F 2 + Fi “I 

1 


and Fdc = F 2 

“ L2F 2 + f J 

1.2 (F x + F 2 )J 

12 

Relations same as in Fig. 10 with 2V replaced by 2V' + 2 Nb' and Nb replaced by Nb' 

13 

Relations same as in Fig. 10 with N replaced by 2 (2Vi + 2V 2 ) and Nb replaced by Ni 
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middle of its calculated range, i.e., with the Advance 
current set at 

Ui av ^that is, 

With the transmitter and receiver windings directly 
in parallel, then at every instant during the switching 
process, 

d(j) T d(f> R 

N t — Nr 

dt dt 



where <j>r and <f>R are the switched fluxes in the T and R 
elements, respectively. With equal turns, Nt = Nr, the 
branch currents will divide so as to cause the same 
number of thresholds, n, in the transmitter and receiver. 
This will result in equal switching rates, i.e., <i>T = <j>R- 
Furthermore, according to the <j>-NI model of Fig. 5, 
It and Ir will be constant during the switching process 
(see Fig. 23). The number of thresholds in the trans- 
mitter is equal to n t = iV7 r av / Fi and in the receiver is 
equal to NIr^/Fz where ir av + Ir w = Ia & v - By setting 
n t = n r = n, then we find the relation 


3F 2 + Ft 

n = — • 

2 (F» + Fi) 


( 2 ) 


In the limiting case, Fi = 0, then 

n° = 1.5 thresholds. (2a) 


This value of n° could easily have been predicted by 
noting that 


/a av 


3 Fi 
2 N * 


For the ideal case assumed here, during one transfer, the 
entire Advance current flows into the receiver. Thus, 
n° = NIa**/ F 2 . 

The relations for range, for n, and for switching speed 
proportional to n— 1 derived here are listed in Table I, 
along with corresponding relations for the circuits de- 
rived below. (It should be kept in mind that speed, 1/r, 
where r is switching time, is proportional to the excess 
number of thresholds of drive and hence n — 1, since n 
was defined as the total number of thresholds.) 


Another way to improve the speed, and in general the 
range as well, is to provide as much drive as possible 
about the output aperture of the transmitter. Under the 
ideal condition of threshold Fi = 0, then it requires no 
current to switch the transmitter, and Ir — Ia during 
one transmission. This is equivalent to saying that all 
of the mmf appearing about the transmitter output 
aperture in the zero state is transferred to the receiver 
via the coupling loop during one transmission. 

Consider again the elementary coupling loop of Fig. 6 
with Ij i max applied, Fig. 6(a) and 6(b). During zero trans- 
fer, both the transmitter and receiver are stressed with a 
threshold mmf F 2 . During one transfer, at least ideally, 
the receiver becomes stressed by 2 F 2 . If appropriate 
circuitry can provide higher stresses in the transmitter 
(around the output aperture) during zero transfer, then 
the receiver stress during one transmission is correspond- 
ingly higher. Generally, this can be achieved in two 
ways, Fig. 6(c) and 6(d). In Fig. 6(c), the coupling loop 
applies an mmf of 2 F 2 , which ordinarily would tend to 
set a zero transmitter. However, this is prevented by a 
bias equal to F 2 so that the net setting mmf about the 
central aperture in the zero state is still only F 2 . Further- 
more, the bias F 2 is not strong enough to clear a set 
transmitter during one transfer. Thus, with this arrange- 
ment, a stress of 2 F 2 can be added to the receiver during 
one transfer, resulting in a total receiver stress of 3F 2 . 

In the arrangement of Fig. 6(d), a zero state stress 
of 2 F 2 is also achieved in the transmitter, but this is 
obtained by applying an extra clear direction drive of 
magnitude F 2 on the inner leg. 

Alternate schemes combining these approaches may 
be used as well, as indicated in Fig. 6(e), where k, which 
may have any value, but practically will lie in the 
range 1 < ^< 2, is an arbitrary constant. In this case, the 
net stress about the output aperture is 2 F 2 , independent 
of k. Note that the circuit of Fig. 6(c) results from k = 2, 
and the circuit of Fig. 6(d) for k — 1. 

The initial circuit arrangements that follow are 
motivated from these concepts. However, as these cir- 
cuits develop other concepts arise which lead to still 
other circuits. 

III. Schemes Utilizing Only a Single Winding 
in the Input and Output Apertures 


Motivation 


Transmitter Bias 


It is clear from the above relations for R and n that 
one way to improve the speed and advance current 
range is to make F 2 large relative to F\. This implies 
the use of a large diameter element, compared to the 
diameter of the small aperture. However, a large ele- 
ment is undesirable for many obvious reasons. It is for- 
tunate, though, that the equivalent of a large element 
can be obtained by appropriate biasing arrangements. 
But even further, the biasing arrangements can improve 
the operation even beyond what would be expected 
merely of a “larger” element. 


With a current Irt in the clear direction through a 
winding of Net turns linking the central aperture of the 
transmitter (Fig. 7), the Advance mmf NtIt must first 
overcome the mmf F BT — NrtIbt before it can switch 
flux about the central aperture of the transmitter. This 
“bias,” therefore, effectively increases the magnitude of 
the threshold F 2 by the value Fbt- The equivalent 
threshold is, therefore, F 2 = F 2 + F B t- The threshold Fi 
is not affected, however, since the bias current does not 
link the flux paths that are local about the output 
aperture. Thus, to the electrical circuit the element ap- 
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pears to be larger than it actually is. The magnitude of 
bias is limited to a value NbtIbt = F% or the bias mmf 
itself would tend to clear a Set transmitter. Thus, with 
maximum bias, F B t — F<i , the element appears to be 
twice as large in diameter, and the effective threshold 
F{ is essentially twice F 2 . 

Receiver Bias 

With the transmitter in the zero state, the manner in 
which I a initially tends to divide between transmitter 
and receiver branches depends upon the branch in- 
ductances L t and Lb, which are attributable mainly to 
the saturation permeability of the ferrite. However, the 
final division of current depends only on branch wire re- 
sistances Rt and Rr. For identical elements and with 
turns ratio Nt/Nr, the ratio of branch inductances is 
( Nt/Nr ) 2 . It is desirable to make the resistance ratio 
the same in order to eliminate transient overshoots in 
the branch currents. At the same time, it is desirable for 
Rt/Rr to be in the ratio Nt/Nr (in the case of no trans- 
mitter bias) in order for the final mmf’s applied to trans- 
mitter and receiver to be equal. The above two con- 
ditions appear to be incompatible for Nt/Nrt^I or even 
for Nt/Nr = 1 if transmitter bias is being used. How- 
ever, by application of bias N B rIbr to the receiver 
(Fig. 8), an additional term is added to the receiver mmf 
and the above conditions can both be satisfied. In this 
sense then, receiver bias serves as a free parameter for 
simultaneous satisfaction of one additional loop con- 
dition. 

For the case Nt = Nr — N, it is clear that receiver bias 
should be equal to transmitter bias for proper balance. 
At the same time, this leads to a symmetrical loop 
capable of bidirectional transmission. 

It is important to note that whereas the transmitter 
bias must be limited to, at most, the value Fi, the re- 
ceiver bias can be of arbitrary value. This is so because 
the receiver current can only cause flux switching about 
the central aperture of the receiver, and the only con- 
cern, therefore, is with the net mmf ( NrIr — N B rI B r ) 
about the central aperture. During one transmission any 
flux switching about the central aperture of the trans- 
mitter in the Clear direction is detrimental. 

Simultaneous Transmitter and Receiver Bias 

With bias F B = F bt = F B r applied, the maximum 
value of Ci max is [Fig. 9(a)] 




l T = 0 

Ir-I 


MAX, 

A 


t MAX. 2 F 2 
A " N 
ZERO TRANSFER 

(a) 


, MAX. 

A 

ONE TRANSFER 
(b) 


2 F 2 


ZERO 


(c) 



(d) (e) 

Fig. 6 — Basic biasing arrangements. 






(a) ZERO TRANSFER 


(b) ONE TRANSFER 


Fig. 9 — Circuit using separate transmitter and receiver bias. 


A max 


2 (F a + F b ) 2 FP 
N N 


where FP = F^-p F B , and from Fig. 9(b), lA min is 




F 1 F 2 d - B 

I A rain = 1 

N n 


N 


(a) ZERO TRANSFER 


(b) ONE TRANSFER 


N 


Fig. 10 — Circuit using compatible transmitter and receiver bias. 
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Therefore, the range R is 


Rsb — 


2 


" F 2 b - Ft ~ 

_3F, B + Fx_ 


X 100 


and, again for the limiting case Fi = 0, 


For values larger than this, the transmitter would tend 
to be cleared during transfer. In either case, lA min is 
( 3 ) determined from Fig. 10(b), where 

Fil . 1 

ia min = — + F 2 + N B lA mm 

N J T J R 


Rsb 0 = 67 per cent 


(3 a) or 


where the subscript SB implies that this range is for 
circuit with simple bias. The switching factor n can be 
found in the same way as for (2) ; thus, 


NI r av 
nsB — — - — 
F i 


NI r ™ - F b 

F~ 2 


and 


3F 2 + Fb + Fi 
2(F 2 + Fi) 


F b 

n SB ° =1.S-| (4a) 

2 F 2 

It is interesting to note that with bias mmf Fb ap- 
plied, the range is increased exactly as though Fi were 
replaced by a larger element of dimension F 2 B = F 2 -\- Fb. 
However, the switching speed is improved beyond just 
the simple substitution of F 2 B for F 2 in (2) . 

Thus, by the use of bias in this manner, the switching 
properties are significantly improved. This bias could be 
provided from a dc source, or from a pulse source. In 
the next section, it is indicated that still further ad- 
vantage is obtained by having the bias mmf provided 
by the Advance current itself. 

Compatible Bias 

In order to keep the number of current sources to a 
minimum and also to insure that the receiver bias oper- 
ates only during the Advance current (if, by design, it 
should be greater than F 2 ), it is desirable to have the 
Advance current itself provide the bias as indicated in 
Fig. 10. 

In this circuit arrangement, there are two conditions 
for lA max . For N>4Nb, then I A m!iX is limited by the zero 
transfer, in which case 


F 2 + F\ 

I A min 

N - N b 

The resulting relations for R C b, n CB , Rcb 0 , and ncB° 
are given in Table I, where the subscripts CB imply the 
compatible bias use. Note that Rcb and n C B are func- 
tions of N and Nb- Of course, with Nb = 0 these relations 
reduce to the same relations as given in (1) and (2). 

Maximum Rcb 0 and ncB° occur for N = 4N B in which 
case 


Rcb 0 = 100 per cent (5) 

n CB ° =2. (6) 

The increased range obtained with compatible bias 
can be explained in terms of “moving thresholds.” That 
is, with compatible bias, the effective transmitter 
threshold ( F 2 -\-NbIa ) is itself a function of Ia ■ Hence, 
as I a increases from the center of its range and, there- 
fore, tends to approach the threshold F 2 , the effective 
threshold value itself tends to increase, reducing con- 
siderably the overrunning effect. The same stabilization 
results in the case of Advance current reduction as well, 
as far as the receiver is concerned. Thus, as I a decreases 
from the center of its range, the receiver moves further 
from threshold, but the effective threshold itself is de- 
creasing. Note that with N = 4Nb, and with Ai max ap- 
plied, the circuit of Fig. 10 exactly matches the condi- 
tions of Fig. 6(c). 

Thus, significant improvement in R and n is obtained 
by the use of compatible bias. However, in practical cir- 
cuits, N is greater than Nb, and since the minimum 
value of N b is unity, single turn coupling loops cannot 
effectively be used. 


F 2 + N b Ia max 
N 


l 


+ 


F 2 + N b IA 
N 


or 


Ia 


max — 


2 Fi 

N — 2N b ' 


If /a™* were larger than this value, then the trans- 
mitter and receiver would overrun their threshold 
during zero transfer. 

For N<4Nb, then At max is limited by the one transfer, 
in which case 


I A maX = 


Ft_ 

N b 


Counter Bias 

In order to improve the stabilization and, therefore, 
increase the Advance current range still further, it is 
necessary to increase the feedback effect; i.e ., it is neces- 
sary to make the bias “move” even faster as a function 
of Advance current. This may be achieved by increasing 
the bias turns Nb relative to the coupling loop turns N. 
However, for a given value of Ia and N, as the bias 
turns are increased the transmitter bias increases, and 
soon overruns 100 per cent. This effect may be com- 
pensated for by use of a dc bias of opposite sign (i.e., 
counter bias), as indicated in Fig. 11. 

As in the previous case there are two conditions for 
/a max . For the case where Fdc> ((4N B /N) — 1)F 2 , then 
Ia max is determined from the one transfer case. 
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Ip** _ 


+ 


F 2 + N B I A max - F d0 ■ 
~N , 

F 2 + N b Ia max - Fdo ' 
N 


or 


IP 


2(F 2 - F d0 ) 
N + 2N b 


The minimum current lA min is determined for one 
transfer and is 


Ia 


min 


Fil F 2 + N B lA min ~ F dc - 
AJr N 


or 


Ia min = 


F 2 — F de T F i 
N - N b 


These combine into the relations for R d0 and n dc 
given in the table. 

For the alternate condition F d(s <(4:NB/N—l)F 2 , 
then J,t max is given simply from the one transfer case as 


/ max — 
A 


F 2 + Fdo 

N b 


The relation for lA min is the same as before. 

The range is a maximum where F do = ((4 Nb/ N) — 1) F 2 . 
Nominally, the dc bias is limited in magnitude to the 
value of F 2 . For this value of dc bias maximum range 
occurs for N/Nb = 2, and is equal to 200 per cent. The 
corresponding value of switching factor is n° = 2. Thus, 
ultimately, a 2 to 1 improvement in operating range is 
achieved. 

This improvement in operating range is obtained at 
the expense of an extra dc bias. However, no extra wind- 
ings are required since the dc current may be simply 
carried on the existing Clear windings. Furthermore, al- 
though range improvement is obtained at the expense of 
a new current source, dc currents are very simply regu- 
lated compared with pulse currents. Finally, the magni- 
tude of the dc counter-bias current may be used as a 
fine control to aid in achieving the optimum operating 
point. 


[IV. Multiple Windings in Input 
and Output Apertures 

The circuits discussed thus far have all employed only 
a single winding in the input and output apertures. By 
relaxing this restriction, significant advantages can be 
obtained. 

Drive on Inner Output Leg 

With low-turn windings in the coupling loops, the Ad- 
vance currents can become relatively large. By using the 
drive scheme indicated in Fig. 12, then for the same 
number of coupling loop turns, the advantages of com- 



Fig. 11 — Circuit using dc counter bias. 



Fig. 12 — Circuit using inner-leg drive. 



patible bias can be obtained but with significantly 
lower Advance currents. In this circuit the Advance cur- 
rent is made to link the inner leg about the output aper- 
ture of the transmitter, as well as the coupling loop it- 
self. 

The equations for this case are identical to the equa- 
tions for compatible bias if in the latter equations N is 
replaced by (iV'+2AV) and Nb by AV. Thus, for ex- 
ample, the case N f — 3, Nb'~ 1 with the present scheme 
is identical to N— 5, Nb = 1 in the earlier scheme. Thus, 
N f = 2, Nb' = 1 yields maximum range corresponding to 
the case N — 4, Nb — 1, which was shown to be optimum 
for the earlier scheme. 

Although, for a given number of coupling loop turns, 
there are more total turns in the small aperture, the ad- 
ditional drive turns may be of relatively small wire, since 
the main concern is qnly to make the coupling loop 
turns as large as possible to reduce coupling loop re- 
sistive losses. 

Floating Coupling Loop 

The circuits discussed thus far have the following two 
disadvantages: 1) with the coupling loop directly 
driven, care has to be exercised in physically connecting 
the two branch windings together so that proper ratios 
of the (parasitic) resistance and inductance are main- 
tained and 2) since the same advance current flows 
through the coupling loops and bias windings, there are 
restrictions on the combinations of turns that may be 
used. 

By allowing “floating” coupling loops, both of the 
above disadvantages are overcome. In the circuit of 
Fig. 13, the turns N, Ni, and N 2 are completely inde- 
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pendent. The case Ni = Nz is equivalent to the maxi- 
mum range case N' = 2, Nb'~ 1 in the inner leg drive 
case, which in turn is equivalent to the maximum range 
case N = 4, Nb = 1 for compatible bias. The equations 
for this case are identical to the equations for compatible 
bias if N is replaced by 2(A r i+2V r 2) and Nb is replaced 
by N\. 

Note that with N\ = N% and with J,i max applied, the 
circuit of Fig. 13 exactly matches the conditions of Fig. 
6(d), except that in this case the stress on the outer leg 
of the transmitter (and receiver) is applied directly 
from a drive winding instead of from coupling loop cur- 
rent. Current flows in the coupling loop only during one 
transfer. 



G 



(c ) 


Fig. 14 — Gain relations for bi-stable operation. 



V. Coupling Loop Flux Relations 

The principal quantities of concern in these circuits 
are flux and current. Data are stored in particular flux 
patterns, and the currents are provided to move these 
patterns about in appropriate manner. In the earlier sec- 
tions of this paper, it was implicitly assumed that the 
necessary flux relations were taken care of separately 
and that a reasonable estimate of range could be ob- 
tained by supposing that no flux switches during zero 
transfer and that complete switching occurs during one 
transfer. 

In order for stable two-level operation to exist, the 
input (or received) flux^ij at each transfer and the input 
flux <j>R at the previous transfer must be related as indi- 
cated in Fig. 14, where the gain G=<j>R/4>R f is >1 in 
the interval < pi <f>R <4>u and is < 1 in the interval 
<f>L<4>R f <d> i- Thus, the transfer operation will tend to 
increase a “low” one level of flux toward (f>u, and to de- 
crease a “high” zero level of flux toward <f>L • This is 
equivalent to saying that with such a gain relation, the 
operation stably protects against “zero build-up” and 
“one build-down.” 

With turns ratio greater than unity, i.e., N t >Nr, it 
is a straightforward matter to arrange a coupling loop 
to obtain the necessary relation between 4>r and <f>R. 
Suppose that there are no losses in the coupling loop of 
Fig. 15(a). Then by integrating the relation Nt4>t 
= Nr4>r, which must hold at every instant of the switch- 
ing period, we find the relation N T d>r = Nr4>r, where <j>T 
and (!>r are the net flux changes in the transmitter and 
receiver. Then, provided <f>T=(f>R ', 

<t>R N T 

= > 1 . 

4>r Nr 

This relation between <f>R and <f>r is illustrated in Fig. 
15(b). Notice that the linear relation holds only until 
< Pr saturates. In order finally to obtain the necessary re- 
lation of Fig. 14(b), consider the coupling loop of Fig. 
15(c) in which a “clipper” core is added. This core is ar- 
ranged to have a total flux linkage capacity of <f>c, which 
is a relatively small fraction of the saturation flux of the 
MAD’s. However, the switching threshold is much 






Fig. 15— Coupling loop arrangement for achieving 
proper gain relations. 

lower than that of the receiver, and therefore, its full 
capacity of flux will be substantially switched before the 
receiver begins switching at all. In this way, a constant 
amount of flux is subtracted from the quantity 
( Nt/Nr)4>t when ><fc; if {Nt/Nr}^ <(f>c, then <f) R = 0. 
The resulting relation between <1>r and d>T=4>R is shown 
in Fig. 15(d). Notice that this curve has the proper form 
for bistable operation (with 4>l = 0 in this case). If the 
flux clipper is cleared at the same time as the transmit- 
ter, none of the basic clock cycle operations of Fig. 4 is 
altered, and very good transfer loop operation is 
achieved. 

The use of a flux clipper along with the condition 
Nt/Nr>1 makes the gain properties of the transfer 
loop very explicit. However, a flux clipper is not ac- 
tually required. In fact, it is further demonstrated in the 
following sections that the relation Nt/Nr> 1 is also 
not required and that successful operation can be 
achieved with N t /Nr% 1. The case Nt = Nr is interest- 
ing because the transfer loop is symmetrical, and bi-di- 
rectional shifting is possible by merely reversing the se- 
quence of Clear pulses or sequence of Advance pulses in 
Fig. 4. The case N t — Nr — 1 is particularly interesting 
because of the simple assembly schemes that are made 
possible. The case Nt>Nr is useful where bi-directional 
properties are not necessary, and extra flux gain is re- 
quired. No particular advantages can be seen for the 
case Nt<Nr. 

An explicit clipper core is not required because a 
partial clipping action is performed by the parasitic re- 
sistance and inductance of the coupling loop. With the 
Advance current set in the middle of its range, neither 
the transmitter nor receiver is brought up to its thresh- 
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old level during zero transmission. Thus, during one 
transmission, a considerable amount of current must be 
steered to the receiver before any flux can be switched 
there at all. As the current begins to rise, flux linkage is 
lost (or stored) in the loop inductance L in the form of 
LIl volt-sec, where II is the loop current. Any of this 
stored flux linkage remaining in the loop after switching 
stops in the receiver (whether due to the advance pulse 
ending or to the receiver current dropping below the ef- 
fective threshold) is dissipated in the loop resistance and 
lost. Furthermore, there is an additional resistive loss of 
flux linkage in terms of fI L RLdt (where II and R L are 
loop current and resistance) during the time in which 
the receiver is switching. Most significantly, however, 
for sufficiently low levels of flux, <j> T will be absorbed en- 
tirely into Jl LRidt+LI l before I l brings the receiver up 
to threshold, and therefore 4>t will be 100 per cent lost. 
Hence the plot of (j)R vs <f>T will start from zero at some 
value of 4>t> 0, just as in Fig. 15(d). 

Unlike Fig. 15(d), the curve will now not be linear, 
but as long as the turns ratio is just great enough to 
bring the curve above the (5r—<5r' line at some higher 
value of 4>r , bistable operation will be achieved. The 
turns ratio required is not high, 6/5 being a typical ex- 
ample. In fact, a high turns ratio is quite undesirable, 
since the excess drive available for switching is reduced. 
For example, if Nt/Nr — 2, then for a simple coupling 
loop with no bias, Ito = F 2 /Nt is an upper bound on the 
current available for steering to the receiver during one 
transfer. The receiver mmf provided by this current 
would be (Nr/ NR) F 2 = (1/2) F%, whereas the corre- 
sponding figure for unity turns ratio would be F 2 . 

Before discussing the properties of unity-turns-ratio 
operation (Nt = Nr) , let us consider some basic switch- 
ing properties of magnetic cores that will be useful in 
later discussions. 

VI. Some Basic Switching Properties of 
Conventional Cores 

Consider the flux-current relations for the conven- 
tional toroid of Fig. 16(a). Assume that the B-H curve 
for the material is ideally square, Fig. 16(b). 

With very long setting pulses I s , the 4> S -F S curve is 
as indicated in Fig. 16(c), where (f> s is the amount of 
switched flux in response to a setting mmf F S = N S I S 
applied to a well-cleared core. The ratio Fb to F a is the 
same as the ratio r a to r*- (outer to inner radius). This 
curve may be automatically plotted by setting up a con- 
tinuous pattern of alternate Clear and Set currents, in 
which the Set current is made to vary in amplitude from 
cycle to cycle. By deflection of an oscilloscope beam in 
the x direction in response to current I s and in the y 
direction in response to switched flux ( = f edt) , the 
(f) s -F s curve is automatically traced. In all of the 4>-F 
curves to be considered here, (f> represents remanent flux 
(i.e., does not include the elastic or reversible com- 
ponent of flux). To plot remanent flux curves, it is only 
necessary to energize the oscilloscope beam just after 



Fig. 16 — -Core switching experiment. 


the setting current is over, by which time the elastic 
flux has been removed. To maintain the x deflection un- 
changed until such a time as the beam is energized, it 
is merely necessary to connect the Advance current 
pulse to the x deflection plates via a delay line, Fig. 
16(d). 

An interesting property to consider in relation to the 
4> S -F S curve is its dependence on the pulse width of the 
setting current. Consider a switching model in which it 
is assumed that the switching rate dB/dt in any portion 
of the material is proportional to the instantaneous ex- 
cess drive (H—Hf), where H 0 represents the threshold 
field. Although idealized, this model does result in the 
usual inverse relationship between switching time and 
excess field for the case of a thin ring of material. By 
its use, calculated (j> s -F s curves for different pulse 
widths are shown in Fig. 17(a). For very long pulse 
widths (i.e., T— >°°), the curve reduces to that shown in 
Fig. 16(c). However, for a given drive F, as T decreases, 
a smaller and smaller amount of flux is switched; hence 
the <j) s -F s curves are monotonically lowered as T de- 
creases. With this model, for pulse widths greater than 
some critical value T c , each curve has a linear region 
marked on the lower end by the mmf required to just 
saturate the inner radius in time T, and on the upper 
end by the mmf required to just start switching the ma- 
terial at the outer radius. The nonlinear connecting 
regions are approximately parabolic for relatively thin- 
walled cores having a ratio of outer to inner radii of 
about 1.3 or less. 2 

In Fig. 17(b) is shown an actual family of <j) s -F s 
curves. For later comparison, these and all later curves, 
unless otherwise stated, are taken on an experimentally 
molded MAD element (having the nominal dimensions 
indicated in Fig. 19) treated as a conventional core. By 

2 It may also be noted that these curves have the identical form as 
for the case in which a very long switching pulse is used on a core 
having the same dimensions as here but for which the slope of the 
“rising” portion of the B-H curve of the material is a variable. With 
truly vertical sides, the curve r— ►«> applies. With the sides less 
steep, as shown by dotted lines in Fig. 16(b), the family of <j> s -I s 
curves has the identical form of Fig. 17(a). 
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(a) (b) 

Fig. 17 — Calculated and measured 4> S -F S curves. 


shaping about the apertures as indicated in Fig. 19, the 
results are substantially identical with results obtained 
on actual toroids of the same material. Notice that, 
compared with the curves of Fig. 17(a), these curves do 
not radiate in so pronounced a fashion from the value 
F a , but rather are mainly translated horizontally to 
higher values of mmf as the pulse length decreases. This 
property is very important in MAD elements for rea- 
sons that will become clearer below. 

If one looks at the corresponding switching voltage 
curves for this element, it becomes apparent what is 
causing this translation of the 4> S -F S curves. This fam- 
ily of voltage curves vs time is indicated in Fig. 18, 
where the parameter is the magnitude of I s (where I s is 
a very long pulse). The curves of Fig. 18(a) are calcu- 
lated using the model dB/dt °c (FL—Hf). Notice that this 
simple model is very inadequate for predicting the front 
end of the voltage curves. This fact is understandable, 
since this model relates more to the rate of movement of 
existing domain walls. However, if we start with a well- 
cleared core in which there is a minimum of reverse 
domains (walls), then after the pulse I s is turned on, it 
takes a time for domain walls to be established. 3 This is 
reflected in the initial slope of the voltage curves. In any 
case, notice the difference in behavior of the peaks of the 
voltage curves as a function of I s . In the family of ac- 
tual curves, there is a large “peaking delay” at low 
levels of switching. With materials that exhibit signifi- 
cant peaking delay properties, the voltage is almost ex- 
actly zero before the time of peaking. It is straightfor- 
ward to convert the voltage curves of Fig. 18(b) into 
the <f>s-F s curves of Fig. 17(b) and see the reason for 
the increase in threshold of the narrow-pulse curves with 
peaking delay. 

It may be noted that peaking delay is a property of 
some materials and not others. For example, there are 
materials for which the peaks in the switching curves of 
Fig. 18(b) lie almost directly over each other. For these 
materials, the 4> S -F S curves are more like those of Fig. 
17(a). 

Given a core of appropriate material, it is further 
necessary, in order for the core to exhibit peaking de- 
lay, that the setting current I s be applied to a well- 
cleared core. Generally speaking, Clear strengths of at 

* N. Menyuk and J. B. Goodenough, “Magnetic materials for 
digital computers, I. A theory of flux reversal in polycrystalline 
ferromagnetics,” J. Appl. Phys., vol. 6, pp. 8-18; January, 1955. 
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Fig. 18 — -Calculated and measured d(j> s /dt - time curves. 
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Fig. 19 — Shaping of MAD elements. 


least two to three times threshold are required for good 
peaking delay. 

If a MAD is made by cutting apertures in the wall of 
a conventional toroid, Fig. 19(a), then regardless of the 
Clear magnitude, it is impossible to get all material on a 
major loop since the cross-sectional area I 1 + I 2 is less 
than 1 3 for a core of unit height. Thus, even for ma- 
terial that is potentially capable of exhibiting peaking 
delay, this type of construction nullifies the effect. How- 
ever, by appropriate shaping of the element, e.g., so 
that I 1 + I 2 — G, the element treated as a simple toroid 
will exhibit significant peaking delay. 

Another very important switching property related 
to peaking delay is shown by the families of 4> S -F S 
curves taken for the condition in which the core is pre- 
set before I s is applied. The families of <f> s -F s curves 
shown in Fig. 20, contain the magnitude of preset flux 
f>p as the parameter. The difference between the various 
families of curves are that they are taken for different 
combinations of long and short duration preset and set 
pulses. 

For zero preset (4> P — 0), the total switched flux (f> s 
is <f) s ma ‘ x =4> M , where <f> M represents the total flux capacity 
of the core from saturation in one direction to saturation 
in the other. When a core has been preset, the current I s 
has correspondingly less flux available to switch. Thus 
in Fig. 20(b), as the amount of preset flux <f p increases, 
the maximum switchable flux<£ s max —4> M — 4> p . For tracing 
these curves automatically, a repeating cycle of Clear, 
Preset, and Set currents is applied to the core of Fig. 
20(a). 

For each curve, the Preset current is adjusted to 
the appropriate level and the Set current is made to vary 
from cycle to cycle. The oscilloscope plotting is exactly 
as indicated in connection with Fig. 16(d), where the 
beam is turned on just after the Set current is over. 
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<0 (9) <h) 

Fig. 20 — <t> s -F e and d<f> s /dt- time curves for various pulse 
lengths of I p and J s . 

For later convenience in dealing with families of 
curves for MAD elements, the curves of Fig. 20(b) are 
redrawn in Fig. 20(c) with the final rather than initial 
values of flux superimposed. Actually, this is equivalent 
to raising the zero flux level for each curve by the mag- 
nitude <fi p . 

When a well-cleared core is preset to a certain level 
of flux by a long current pulse, one can visualize the flux 
condition in the core to be represented by a circum- 
ferential domain wall outside of which the flux is in a 
clockwise (Clear) direction and inside in a counterclock- 
wise direction. Except in the transition region, sub- 
stantially all material is well saturated. Since I s stresses 
the core in the same direction as I p , it is reasonable for 
gentle preset (long pulse) that, regardless of preset 
level, the current I s should continue the switching where 
I p left off, and with essentially the same threshold, as 
demonstrated in Fig. 20(c). Let us now consider the ef- 
fect of shortening the pulse durations. In order for a 
certain magnitude of flux, <f> p , to be preset as the pre- 
set pulse is decreased in length, the magnitude of the 
preset current must be increased. For significant de- 
crease in duration, the magnitude of current must be 
likewise considerably increased. For a reasonably thin- 
walled core, this increased magnitude of current is ca- 
pable of switching flux simultaneously throughout the 
entire core, so that the current pulse must be shut off 
when the proper level of preset flux is reached. In this 
case, it is certain that reverse domains are distributed 
throughout the body of the core in some random fashion. 
Thus it is hardly surprising that after such a preset 
pulse, the set current, which tends to continue the 
switching in the same direction, finds a much lower and 
less abrupt threshold. This effect is clearly seen when the 
curves of Fig. 20(c) and 20(d) are compared. 


Let us next consider the effect of short set and pre- 
set pulses, Fig. 20(e). For <f> p — 0, the 4> S -I S curve is just 
the appropriate curve of the family of Fig. 17(b), for the 
given pulse duration. If we compare the curves of Fig. 
20(d) and Fig. 20(e), we notice that in the latter case, 
the threshold moves a considerable distance to the right 
for zero preset, but not quite so far for nonzero pre- 
set. This is reasonable in terms of the previous discus- 
sion of the effects of a good Clear state on peaking delay. 
Good peaking delay occurs only when all of the ma- 
terial is in a well saturated condition. Flowever, due to 
presetting with a short pulse, a random distribution of 
reverse domains is left throughout the core, resulting in 
very poor peaking delay after preset and hence very 
little increase in threshold for a short-pulse set. This 
point is demonstrated by the voltage-time curves (sim- 
ilar to those of Fig. 18) taken after a preset condition 
of <5 P = 1/2 4> M using a short preset pulse, Fig. 20(f), 
and a long preset pulse, Fig. 20(g). Notice the signifi- 
cant difference in switching times for these two families. 
The effect of preset is also demonstrated in the <j b s — F s 
curves, Fig. 20(h), taken for the same preset level 
1/2 4> M . The group of curves radiating from the lower 
threshold value of F s is for the short preset pulse; the 
other group is for the long preset pulse. The lowering of 
threshold for short preset pulses is clearly seen. Within 
each group, the parameter is the duration of Set pulse. 

VII. Switching Properties of MAD’s 

Ideal Family of Output Curves 

In Fig. 1, output (f> T -F T curves were shown for the 
two cases of a Set and Clear MAD. Actually, there ex- 
ists a whole family of such curves with the amount of 
preset (or input) flux as the parameter (Fig. 21). In 
Fig. 21(a), the input current is shown linking leg li 
about the input aperture, and the output current is 
shown linking leg 1 4 about the output aperture. Assume 
all legs are of equal dimension. Let <f> M represent the 
total flux capacity in any leg from saturation in one 
direction to saturation in the other direction. With leg 
1 4 saturated in the Clear (clockwise) direction, applica- 
tion of It of sufficient magnitude will switch an amount 
of flux <f> M in leg 1 4 independent of the amount of preset 
flux. A portion of it equal to <j> p ( =<f>i a ) will switch locally 
about the output aperture and the remainder will 
switch around the main aperture. If all material is op- 
erating on an ideal rectangular hysteresis loop, the fam- 
ily of curves will have the form indicated in Fig. 21(b). 

Actual Family of Output Curves 

Actual families of output curves, for the same MAD 
used for the above tests of core switching properties, are 
shown in Fig. 22. These curves are automatically plotted 
by the method previously described and are taken for 
the same combinations of long and short current pulses 
indicated in Fig. 20. Notice that the effects are sub- 
stantially the same as observed in the case of presetting 
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Fig. 22 — Actual f 3 cslies of 4>t-Ft curves for various input 
and output current pulse durations. 



It 

Fig. 23 — Transmitter and receiver currents during one transfer. 


into unequal branch currents It, Ir such that at all in- 
stants of time (j>T = <!>R ■ Because of the lower threshold 
(neglecting the voltage drop in the loop resistance and 
inductance) in the transmitter, Ir>I t , but always 
IrA-It = I a- This situation can be characterized by a 
loop current I l superimposed on the zero transfer cur- 
rents so that Ir — Ir 0 A~I l and I T = It 0 — I l 

Ia\ Ir — It 

where Ir 0 = It q = ~J, or I L = 

During one transfer, the transmitter may be charac- 
terized as a relatively small, thick- walled core [see e.g., 
Figs. 9(b) and 10(b) ] compared with the receiver. Thus, 
for given magnitudes of It and Ir [assuming the switch- 
ing model dB/dt* (H—Ho)], the ratio of switching 
rates at the outer and inner walls is much higher in the 
transmitter than in the receiver. For constant It and 
Ir, the rates of flux change Qt and (j>R will be constant 
in time only as long as no material saturates. Clearly, 
the inner wall of the transmitter will saturate first. 
When this happens, <Pt tends to decrease, since less ma- 
terial participates in the switching. However, this 
tendency is counteracted by an increase of It to bolster 
up 4>t at the expense of 4>r ( i.e ., Ir decreases), so that at 
all times the relation <pT — <i>R is maintained. Thus, the 
resulting I T and Ir curves have the general form indi- 
cated in Fig. 23. 

The main significance of the current relations indi- 
cated here is that during one transfer, the transmitter 
current is initially low but rises toward I a/ 2 while 
transfer continues. 


a toroid. Actually, the short-short current pulse com- 
bination is the one of interest, because this more nearly 
approximates actual operation. In Sections III and IV, 
it was indicated that in biased circuits, the transmitter 
and receiver are switched with two or more thresholds of 
current. This operation corresponds to high-drive, short- 
pulse conditions of measurement. However, it is shown 
below that for one transfer, the transmitter current It 
and receiver current Ir are not constant in time. To 
this extent, the above experimental families of curves, 
which are plotted for rectangular input and output cur- 
rent pulses, do not apply. Nevertheless, they are ex- 
tremely indicative of the nature of the operation. 

The main significance of these curves is the lowering 
of the main aperture threshold for partial set levels rela- 
tive to the Clear State threshold. It is demonstrated 
below that this property provides a mechanism for ob- 
taining proper gain relations for unity-turns-ratio opera- 
tion. 

Transmitter and Receiver Currents 

In the (ideal) zero transfer case, assuming unity turns 
ratio, the Advance current I a divides into equal branch 
currents I To = Ir q — ( 1/2) I a , where the sub “0” stands 
for the zero case. 

During a one transfer, the Advance current divides 


The (f>* Contribution to Flux Gain 

Flux gain G from one stage to the next may be de- 
fined as 4>r/4>r where <Pr is the flux received in the 
transmitter during the previous Advance pulse. If dur- 
ing the present Advance pulse, all available flux about 
the output aperture is switched but none is switched 
about the main aperture of the transmitter, then 
=<Pr. Now <f>R = <f>T — 0ioss> for a single-turn coupling loop 
(a special case of unity-turns-ratio operation) where 
</>ioss, the flux loss in the loop resistance Rl, is fl^Rcdt 
volt-sec. Hence, if 4>t = <1>r , 


Note that G>1 is impossible here because of the sub- 
tractive loss term. This equation is characteristic of the 
operation of a conventional core-diode-type shift reg- 
ister, which requires Nt>Nr in order to obtain G> 1. 

In a MAD arrangement, <Pt can be more than cpR be- 
cause of the possibility of flux switching not only locally 
about the output aperture, but also around the main 
aperture in a direction to increase the setting of the ele- 
ment. For illustration, assume that the Advance cur- 
rent is set in the center of its range, or I a = 2h av > so that 
NIa/2 is below threshold F 2 as indicated in Fig. 24. Also 
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assume a level of input flux c^r' as indicated in the fig- 
ure. Then as It increases from its initial low level (see 
previous section) towards its steady-state value /a/2, 
it soon enters a region in which flux may be switched 
about the main aperture. This flux is defined as <j>*. Thus 
<Pt = and the gain equation is modified to read 


G = 


4>r 



It is clear from Fig. 24 that <f>* is a function of 4>r' hav- 
ing the form indicated in Fig. 25. <f>* is very small at low 
levels of received flux because the maximum value of 
NIt is below threshold for switching around the main 
aperture. <p* is also very small for large <f>R' because of 
saturation. It is important to note that because of the 
finite slope of the 4>t-Ft curve below threshold (even 
for the Clear state curve) that <£* is everywhere >0, 
and hence that 00 as Thus the con- 

tribution of the term <5*/<}>r to the gain equation has the 
form indicated in Fig. 25(b). The steeply rising portion 
of this curve for low values of <t>R contributes to the 
lower unity-gain crossing of the Gain-$j ?' curve at 
<I>r'> 0, resulting in </>£>(), as would be expected. Thus, 
if over some interval, the term is smaller in magni- 
tude than the (j > * term (for (5r><5l), then a gain curve 
of the form indicated in Fig. 14 can be obtained, and the 
interval in question is just the interval (f>i<4>R <4>u 
where G> 1. 


Resistive Flux Loss 

The resistive flux loss, // lRi/U, is just proportional to 
the cross-hatched area in Fig. 23. An accurate analysis 
of 0ioss is extremely difficult, since no accurate switch- 
ing model is available for predicting the current shape 
/l( 0 as a function of flux level, 4>r'. 

In Sections II to IV, for purposes of comparing 
switching speeds for various circuit arrangements, it 
was assumed that I a divides into It and Ir in such a 
way as to apply equal multiples of threshold mmf to 
transmitter and receiver. This model does result in the 
correct ordering of the different circuits in terms of 
speed, but it is very inadequate for predicting <f>\ oss /4>R 
as a function of 4>r' for a given circuit. This fact should 
be quite obvious just from the complexity of measured 
switching characteristics as represented in Figs. 20 and 
22. 

True, one can definitely say that <j> i oss increases with 
4>r', but not even this much can be said about the ratio 
0ioss/$tf', except for very low values of <f>R f . In this latter 
case, as was indicated earlier in the discussion of the 
partial clipping action of loop inductance and resistance, 
0ioss /4>r' will be essentially unity for sufficiently low flux 
levels. 

In effect, then, all that can be said at present is that 
the loss term will decrease from near unity for very low 
4>r' to a value A at some low (j>R f ( = 4> i) and remain below 



Fig. 24 — Demonstration of <j>* flux gain. 



Fig. 25 — Plot <t>* / 4>r' -< t>R and ^Iosb/W-^V. 



Fig. 26 — Addition of gain equation components 
to form the gain curve. 


A at least up to some high <£*/(=<£ 2 ), as indicated in 
Fig. 25(c). 

In Fig. 26, it is shown how the three terms of the 
gain equation 

& 01oss 

G = 1 + — 

4>r' 4>r' 

may add to give the required form indicated in Fig. 14. 
The loss term is added as though equal to A in the inde- 
terminate interval (f>i <4 >r <<5i. 

Whatever the actual variation of the loss term in this 
interval, then, the qualitative nature of the gain curve 
will not be changed, rather only the locations of the 
unity gain points 4>i and 4>u- 

Flux Boost 

In order to get reasonable Advance current range, 
the short-short (fo-Fr curves [Fig. 22(c)] should have 
the Clear state curve moved as far to the right as possi- 
ble, and the higher level flux input curves moved as far 
to the left as possible. Experimental results have indi- 
cated that these conditions are generally met best by 
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materials exhibiting good peaking delay properties 
[Fig. 18(b)], 

The range analysis of Sections II to IV does not apply 
well to the unity-turns ratio case since the advance cur- 
rent is more limited by the requirement of getting 0* 
to make up for flux losses than by the assumptions of 
Section II. 

In any case, the situation can be significantly im- 
proved by the circuit described here. 

Between the time that the flux 0^ is received, and 
flux 4>t is transmitted to the next stage, the previous ele- 
ment is cleared. By having this pulse do double duty, as 
indicated in Fig. 27, the flux boost may be obtained (at 
Clear time) before transfer out of the element occurs. 
With N turns on the receiver, the Clear current is ad- 
justed so that the receiver is brought up to the vicinity 
of threshold F 2 . Thus, if 0 k is low (as for a zero), then 
the boost pulse has essentially no effect. If 0# is high (as 
for a one ) but less than 0 M because of 0i OS s during trans- 
mission, then the boost pulse will increase the set level 
of the element. 



CLEAR 


Fig. 27 — Circuit for achieving flux boost. 

The family of curves of Fig. 24 is taken with a winding 
about the output aperture. However, the reduced 
thresholds for partially set levels are characteristic of 
the state of the entire element (as previously described) 
and would also be observed in 0-F curves taken on the 
flux boost winding. Thus the effect of the flux boost cur- 
rent can be predicted from the0r-FV curves. 

The Clear winding has nN turns, which results in the 
transmitter being cleared with n thresholds of mmf. For 
good clearing, n should be greater than 2. By consulting 
Fig. 4, we see that with this new arrangement, two proc- 
esses are going on simultaneously in the receiver. Clear- 
ing the transmitter causes a negative set current to 
switch flux locally about the input aperture of the re- 
ceiver, and the mmf applied by the flux boost winding 
causes additional set flux to be switched about the main 
aperture. 

In all previous circuits, the Clear pulse had unlimited 
range as long as it was above some minimum. Such is 
not the case in the present circuit, since the Clear cur- 
rent works against a threshold. However, although the 
Clear current range is decreased, the Advance range is 
significantly increased. 

The flux boost method of making up flux losses has 
three main advantages compared to the 0* method. 


First, there are no coupling-loop losses associated with 
flux boost, whereas part of the available 0* is always 
lost during transfer. In fact, with flux boost, it is possible 
for the receiver to be set fully before flux is transferred 
out of it; this condition is impossible to obtain for unity- 
turns-ratio operation without flux boost, because of 
losses. Second, the boost current has a fixed magnitude 
and duration independent of flux level, whereas the cur- 
rent It switching 0* is greater (at least in integrated 
value) for low flux levels than for high levels. Third, the 
boost current can be adjusted in width and amplitude 
independently of the advance current, while the mmf 
that switches 0* is tied to the advance current. 

It is pertinent to note that with flux boost taking care 
of the flux-gain requirement, Advance range is more 
closely predicted by the relations of Sections III and 
IV. Furthermore, flux boost may be used advanta- 
geously in connection with all of the circuits derived in 
that section. This is so in particular for unity- turns-ratio 
operation, but advantage can be obtained even for cir- 
cuits in which Nt / Nr > 1 . 

VIII. Results and Conclusion 

The range and speed relations derived in Sections 
III and IV, although based on a very simple model, do 
properly predict comparative results for the various 
circuit arrangements. As pointed out in the discussion 
on flux boost, these relations do not apply well for unity- 
turns-ratio circuits if flux boost is not used. In this case, 
the Advance currents must be adjusted more to satisfy 
the flux gain requirements than the simple switching 
model used to derive these range and speed relations. 
Listed below is an example of the types of comparative 
results obtained with a coupling loop using N T = 6, 
Nr = 5, for the circuits indicated. 


Circuit 
of Fig. 

Bias turns 
Nbt = Nbr 

Range 

5 

— 

15 per cent 

10 

1 

30 per cent 

11 

2 

50 per cent, using 50 per cent 



dc counter bias 


Flux boost helps even where N t >Nr. For example, 
when flux boost is used, the 15 per cent range obtained 
using no bias will increase to about 30 per cent. In flux 
boost circuits in general, the Clear and Advance cur- 
rents may be independently adjusted to give either one 
a high range at the expense of the other. The range value 
given here, however, implies that the Clear and Advance 
currents are adjusted so that they both have the same 
range, namely 30 per cent. 

Unity-turns-ratio coupling loops, i.e., with Nt = Nr, 
have operated with the following typical results. With 
the bias circuit of Fig. 10, N = 4, N B = 1 , and flux boost, 
Clear and Advance ranges of greater than 40 per cent 
each are achieved. These results are obtained with 1-jusec 
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Clear and Advance pulses driving a register of experi- 
mentally molded elements having the dimensions indi- 
cated in Fig. 19 and made with material having a long- 
pulse threshold of 0.7 oersted. Single- turn coupling-loop 
circuits with the same effective bias operate equally 
well. 

The detailed analysis of these circuits is difficult not 
only because of the usual difficulties of dealing with the 
dynamic properties of highly nonlinear elements, but 
also because of the relatively complex geometries in- 
volved. It is clear that a good deal of the design of these 
circuits is necessarily based on intuition and empirical 
results. The circuits described here can be made to op- 
erate quite well, however, and the lack of analytical 


tools is felt more in trying to decide how or when a par- 
ticular arrangement is optimum. It is hoped that future 
efforts will result in the development of satisfactory 
switching models that will make the circuit design pro- 
cedure routine. 

The techniques presented here provide the potential 
for developing extremely reliable digital circuitry at 
least for the intermediate computer speed ranges of 0.1 
me to 1 me clock (or bit) rates. 
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A Twistor Matrix Memory for Semipermanent 

Information* 

DUNCAN H. LOONEY f 


Introduction 

A NEW magnetic matrix memory has been devel- 
oped for the storage of semipermanent digital in- 
formation. The memory is designed for computers 
which require random access to stored information that 
is changed very infrequently. The information is stored 
in a pattern of permanent magnets arranged on a plastic 
board. The presence or absence of a permanent magnet 
is sensed nondestructively by a wire wrapped with a 
magnetic tape placed close to the permanent magnet. 
A stored word is read by a linear selection system using 
a biased core access switch. 1 

The memory is fabricated in modules. A typical 
module is shown in Fig. 1. The photograph shows a 512- 
word memory consisting of 32 X 16 word locations. Each 
word location stores 26 bits of information. Any word 
location in the memory may be selected at random and 
the information read in a period of a few microseconds. 
The temperature range of operation of the memory is 
extremely wide. 

The concept of storing information in an array of 
permanent magnets was advanced by the late S. Shack- 
ell. Mr. Shackell’s work was interrupted by his untime- 
ly death and has not been previously reported in the 

* The work reported in this paper was done for the U. S. Dept, of 
Defense under Contract DA-30-069-ORD-1955. 
t Bell Telephone Labs., Murray Hill, N. J. 

1 J. A. Rajchman, “A myriabit magnetic core matrix memory,” 
Proc. IRE, vol. 41, pp. 1407-1421; October, 1953. 


literature. With the development of the twistor, 2 John 
Janik, who was familiar with the Shackell scheme, sug- 
gested its use in such a system to reduce the size of the 
permanent magnets. 

The operation of a store using the 512-word memory 
module is described in a companion paper. 3 The store, 
which utilizes all solid-state circuitry, is compared to 
other systems using photographic or magnetic tech- 
niques which can be used for the storage of semiper- 
manent information. 

Operating Principle 

The information is stored in an array of small perma- 
nent magnets. The presence of the magnet is sensed by 
a wire wrapped with magnetic tape placed close to the 
magnets. A group of 26 wrapped wires are encapsulated 
in a plastic tape. The encapsulated wires are then en- 
closed in a set of copper solenoids as illustrated in 
Fig. 2. A particular solenoid corresponding to a stored 
word may be selected by activating one core of the 
biased core access switch. The bar magnets are ar- 
ranged in a pattern on the surface of a thin plastic card. 
Each magnet is located at the intersection of a wrapped 
wire or twistor and a solenoid. The purpose of the 
permanent magnet is to inhibit locally the drive field of 

2 A. H. Bobeck, “A new storage element suitable for large sized 
memory arrays — -the twistor,” Bell Sys. Tech. J., vol. 36, pp. 1319— 
1340; November, 1957. 

3 J. J. DeBuske, J. Janik, and B. H. Simons, “A card changeable 
nondestructive readout twistor store,” this issue, pp. 41-46. 
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Fig. 1 — A 512-word memory module. The unit is composed of 16 
frames of 32 words. The cores of the access switch and the multi- 
turn windings are shown. The encapsulated twistor tape threads 
through the module continuously. 



Fig. 2 — A section of a memory frame. Three cores of the biased core 
switch and their word solenoids are shown. The absence of a bar 
magnet is sensed by the flux reversal of the wrapped wire when a 
drive current flows in the solenoid. 

the solenoid. Thus, the permanent magnet prevents the 
switching of the wrapped wire located beneath the 
permanent magnet. The magnitude of the field in the 
solenoid can be high to achieve fast switching. The field 
of the bar magnet must, however, be sufficient to in- 
hibit the switching of the wrapped wire. A plane of the 
module with a complete magnet board is shown in 
Fig. 3. 

In the present design, the solenoids are 1/16 inch 
wide and spaced 3/16 inch apart. For a current drive of 
1.8 amp in the solenoid a switching speed of about 1 /jl sec 



Fig. 3 — A memory frame of 32 words and its magnet board. Every 
other copper strip is used as a solenoid and is attached to an access 
core. The guide pins of the frame and the corresponding holes of 
the magnet board are necessary to register the bar magnets at the 
intersections of the solenoids and the wrapped wires. 

is obtained. Current pulses of 600 ma each are applied to 
four turn X and Y windings of the biased core switch. 
The bias on each core is 2.4 amp-turns. The X pulse is 
applied first since the X winding is parallel to twistor 
wires and results in a larger inductive signal. The se- 
quence of operations is shown in Fig. 4. Time is meas- 
ured from left to right. The two current pulses are 
shown in Fig. 4(a). When the applied pulses are re- 
moved, the bias current resets the core. The back volt- 
ages on the X winding through 32 cores and the Y wind- 
ing through 16 cores are shown in Fig. 4(b). The core 
requires about 0.6 amp-turns to generate an output 
voltage which will drive 1.8 amp through the solenoid. 
The resultant output signals are shown in Fig. 4(c). 
Both one and zero output signals are shown by inserting 
and removing a magnet board. The one signals average 
8 mv into 180 while the zero signals are less than 2 
mv. The signal -to-noise ratio is 5 to 1. Considering the 
time necessary to turn on the access switch, a 5-/zsec 
cycle time is easily achieved. Since there are no funda- 
mental limitations on the magnitude of the drive or the 
amount of flux which must be reversed, shorter cycle 
periods are possible. i 

The general performance of the module is shown in 
Fig. 5. The two current pulses into the biased core 
switch are shown in Fig. 5(a). The one signals observed 
on one sensing wire from 32 word locations are shown 
superimposed in Fig. 5(b). The one signals observed on 
26 sensing wires at one word location are shown in Fig. 
5(c). The open circuit and matched load output signals 
are shown in Fig. 5(d) for the 512-word module. The re- 
sistive load is 22 0 and is equal to the sensing wire re- 
sistance. 
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(c) 

Fig. 4— Electrical characteristics of the module. Time is measured 
from left to right and each division is 0.5 n sec. (a) Drive currents. 
The upper trace represents the X selection current, the lower 
trace the Y selection current. Each division is 500 ma. (b) Observed 
back voltages. The upper trace is the back EMF on the 4-turn wind- 
ing through the 16 cores of the Y winding, the lower trace that of 
the 32 cores on the X winding. Each division is 5.0 volts, (c) One 
and zero output signals observed with the magnet board present 
and removed. The first low peaks are due to inductive pickup and 
shuttle. Each major division is 5 mv. The output load is 180 0. 


Design of the Permanent Magnet Array 

A number of factors must be considered in the design 
of the permanent magnet array. First, the array of 
magnets must be simple to manufacture. Since the mag- 
nets are the primary storage medium of the memory, 
they must be capable of retaining their magnetization 
under very severe conditions. Methods must be devised 
to register the permanent magnets accurately over the 
bit locations defined by the wrapped w*ires and the word 
solenoids. Finally, the spacing between the individual 
magnets must be chosen carefully, since one magnet 
should not disturb either the neighboring magnet or its 
sensing wire. 

For illustrative purposes, the bar magnet will be con- 
sidered as two magnetic charges + m spaced a distance 
d apart. The magnetic field H at any distance r per- 
pendicular to the center of the magnet is: 

md 

H = — (1) 

[(d/2) 2 + r 2 ] 3/2 






(d) 

Fig. 5 — Output signals of the module. Time is measured from left to 
right and each division is 0.5 nsec, (a) Drive currents. The upper 
trace is the X selection current, the lower trace the Y current. 
Each major division is 500 ma. (b) 32 “one” signals observed on 
one sensing wire from 32 cores on one frame. Each major division 
is 5.0 mv. The load is 180 ft. (c) 26 “one” signals observed on the 
26 sensing wires in one word location. Each major division is 
5.0 mv. The output resistance is 18012. (d) The output signals for 
open circuit and matched output loads. The output resistance of 
the lower trace is 22 ft and is equal to the resistance of the sense 
wire. Each major division is 5.0 mv. 


(1) reduces to: 


Ho 


8 m 
d 2 


( 2 ) 


The field which acts on a sensing wire placed near the 
magnet is, then, inversely proportional to the square of 
the length of the magnet. For values of r—> oo , (1) re- 
duces to: 


and is directed parallel to the axis of the magnet. There 
are two limits to be considered. In the limit of r— >0, 


md 



( 3 ) 
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Thus, for neighboring positions, the magnetic field is 
proportional to the length of the magnet. 

Ideally, a permanent magnet should have a large ef- 
fect on the sensing element just underneath it and no ef- 
fect on any of the adjacent elements. The ratio of the 
field for r small to the field for r large should be very 
high. Using the two previous approximations, the ratio 
is: 


In order to reduce the interaction, the magnets should 
be made as small as possible. A number of other factors, 
however, prevent the magnet from being made ex- 
tremely small. The first is that the length of wrapped 
wire underneath the bar magnet must be large enough 
to produce a detectable output signal. In addition, the 
demagnetizing factors associated both with the wrapped 
wire and with the permanent magnet must be consid- 
ered. The demagnetizing field of the permanent magnet 
is inversely proportional to the square of the length d of 
the magnet. If d is reduced until the demagnetizing 
field is greater than the coercive force, the effective pole 
strength m will be reduced. In the case of the wire used 
as a sensing element, the demagnetizing field of the flux 
reversed must be less than the applied driving field. 

Boards containing the permanent magnet arrays are 
prepared by etching sheets of Vicalloy I which have 
been bonded to plastic boards. Vicalloy tape can be ob- 
tained in strips about 4 inches wide and 2 mils thick. 
The Vicalloy is heat treated to produce a saturation 
magnetization of 5000 gauss and a coercive field of 200 
oersteds. The individual magnets are etched using the 
standard photo resist etched wire technique. Master 
negatives are prepared such that the individual magnets 
may be removed by masking out their positions on the 
negative. Consequently, all information patterns may 
be prepared from one master negative. The flat form of 
the magnet simplifies its positioning over the bit loca- 
tion. The bar magnet used is 20X60X2 mils. The direc- 
tion of magnetization is parallel to the long dimension. 
The card containing the magnets is placed in registra- 
tion by guide pins and is pressed firmly against the 
solenoids by springs. Consequently, the separation of 
the permanent magnet from the sensing wire is only a 
few mils. The two-pole approximation cannot be used to 
determine the true magnetic field for distances compar- 
able to the bar magnet length. Experimentally, the 
magnetic field on the wire beneath the permanent mag- 
net is about 20 oersteds. The field on the nearest 
neighbor is about 1 oersted. 

The magnets were spaced unequally in the three 
dimensions of the present design. In order to reduce the 
inductance of the solenoid strip encompassing the sens- 
ing wires, the wires are spaced 100 mils apart which was 
considered the minimum distance to avoid lateral inter- 
actions. To prevent the permanent magnets of one word 


from acting too strongly upon the sensing wires in the 
next word, the individual solenoids are separated 3/16 
inch. This distance should be kept small to minimize 
the length of line over which the output signal must 
travel before reaching the detecting amplifier. Finally, 
the individual frames are spaced about | inch center to 
center. One quarter of an inch of this spacing is used for 
the solenoids, the sensing wires, and the supporting 
board. The remaining \ inch is taken up by the perma- 
nent magnet card and its spring assembly. Thus it is 
quite easy to slip the card in and out as is shown in Fig. 6. 



Fig. 6 — Rear view of the 512-word memory module. The magnet 
board is being inserted. It is held against the memory frame by a 
spring. 

The Wrapped Wire Used for Sensing 

The twistor wire is shown in Fig. 7. A three-mil copper 
wire has a magnetic tape wrapped around it at an angle 
of about 45°. The particular material used is 4-79 per- 
malloy. The tape has a coercive force of about 3 oersteds 
and a cross section of 5 X0.3 mils. The length of the wire 
per bit is determined by the width of the solenoid em- 
ployed and is made the same width as the bar magnet for 
simplicity, i.e., 60 mils. 

The cross section of the tape must be adjusted to 
satisfy a number of conditions. First, the ratio of the bit 
length to tape thickness determines the demagnetizing 
field. It is desirable to keep the demagnetizing field 
small since it decreases the effective driving field. Also, 
the thickness of the tape should be kept below § mil to 
insure that the eddy current losses are not excessive. 
The amount of material determines the size of the access 
core and no more material should be included than is 
necessary to provide a detectable signal. Finally, the 
wrapped wire consists of a copper conductor used for the 
transmission of information wrapped with a magnetic 
tape used for the detection of information at particular 
locations. Since the magnetic material acts as a loading 
on the transmission line, it is desirable to keep the 
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amount of magnetic material to a minimum. The sense 
wire is a small copper wire and has an appreciable re- 
sistance per unit length. It is desirable to use a large 
diameter wire to minimize the attenuation. Unfortu- 
nately, the output signal is determined by the number 
of times per bit length that the flattened tape wraps 
around the center conductor. For a given wrapping 
pitch and bit length, the number of wraps decreases as 
the diameter increases. Thus, the output signal would 
be reduced. A compromise must be made between the 
attenuation which produces a variation of output signal 
from near bit locations to far bit locations, and the 
amplitude of the output signals. 



Fig. 7 — Helical magnetization by wrapping. The permalloy tape is 
wrapped continuously around the 3-mil copper conductor. 

The uniformity of the output signals is improved if 
the wire is magnetized in one direction. Consequently 
the wrapped wire is magnetized before the memory is 
placed in operation by the passage of a dc current down 
the wire. The permanent magnet field is directed to 
maintain the continuous magnetization of the wire. The 
drive field must switch the wrapped wire into an oppo- 
site state of magnetization. As a result, a demagnetizing 
field is created which opposes the drive field. When the 
core switch resets the bit, the demagnetization field is 
in the direction to aid the resetting of the bit. Thus, the 
wire will remain in a uniformly magnetized state. 

The Biased Core Switch 

An equivalent circuit for the biased core switch with 
a constant current input is shown in Fig. 8. The net 
current drive is the number of ampere turns of the bias 
since each of the X and Y currents provide the same 
number of ampere turns as the bias. The core can be 
represented by two circuit elements. The first is a cur- 
rent sink of 2VT 0 which represents the fact that a given 
number ampere turns must be applied to the core before 
it begins to switch. The core, during the switching op- 
eration, acts as a resistance 2? c , 4 which is proportional to 

4 E. A. Sands, “Behavior of rectangular hysteresis loop magnetic 
materials under current pulse conditions,” Proc. IRE, vol. 40, pp. 

1246-1250; October, 1952. 


the total flux of the core divided by the product of the 
switching coefficient 5 of the material in the core and the 
mean path length around the core. The core, in switch- 
ing, generates a voltage which forces current into the 
load Zl, which represents the solenoid and its sensing 
wires. In order to switch current effectively through the 
core, it is desirable that R c be made large compared to 
Zl . One possibility is to make the flux in the core large. 



Fig. 8 — An equivalent circuit for a biased core switch for a constant 
current input I. N is the ratio of input to output turns. The 
switch core is represented by the current sink NIo and the re- 
sistance R c . 

However, the total back voltage on a selection line is a 
function of the total inductance of the cores on the line. 
To reduce the inductance and hence the back voltage, 
it is desirable to keep the core cross section and thus the 
total flux small. The minimum flux must be sufficient to 
supply an output voltage to drive the required current 
into the load Zl long enough to complete the sensing. 
The resistance of the solenoid, its inductance, and the 
flux, which must be switched in each of the wrapped wire 
elements contained in the solenoid, determine the re- 
quired flux of the access core. Since the flux is equal to 
Jo^odt, a convenient nux unit is mv /xsec. The flux re- 
quired by the 26 wrapped wires is about 30 mv /xsec. 
The flux which must be supplied to drive the inductance 
with a current of 2 amp is about 50 mv /xsec while the 
flux necessary to drive the current through the resistance 
of the solenoid for 1 /xsec is 100 mv /xsec. A total of 180 
mv /x sec of flux must be supplied as a minimum by the 
access core. The cross section of the core is made suf- 
ficiently large to contain 300 mv /xsec of available flux. 
Only about 200 mv /xsec of the available flux is used. 

The access core is made of ferrite containing cadmium 
as well as manganese and magnesium. The core has an 
extremely flat hysteresis loop, Br/B s cM).93, and a very 
low coercive field, iJ c ~0.15 oersted. If a permalloy core 
were used, only a few wraps would be required to supply 
the 300 mv /xsec of flux. Tape cores with a small number 
of wraps of J or f mil tape are not as square as the fer- 
rite core. The tape core has higher dynamic resistance 
for a given flux than the ferrite core, but this factor is 
less important than the superior squareness and the 
lower cost of the ferrite core. In fact, the dynamic re- 
sistance of the ferrite core is so large that the current 

6 N. Menyuk and J. B. Goodenough, “Magnetic materials for 
digital computer components,” J. Appl. Phys., vol. 26, pp. 8-18; 
January, 1955. 
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regulation of the switch is extremely good. The biased 
core switch improves the rise time of the current de- 
livered to the load compared to the rise time of the drive 
currents. 

Summary 

A 5 12- word 26 bits per word module of a magnetic 
matrix memory has been developed for the storage of 
semipermanent information. The memofy is capable of 
random addressing at high cycling speeds. The infor- 
mation is stored in an array of permanent magnets. The 
presence of a permanent magnet is sensed by a wire 
wrapped with magnetic tape placed adjacent to the 
permanent magnet. The magnetic materials used in the 
memory are permalloy tape, Vicalloy tape, and ferrite 
cores. As used all materials are relatively insensitive to 
any change in the ambient temperature and, conse- 
quently, the memory may be operated over a wide 
temperature range. 

The electrical characteristics of the memory module 
can be compared to those of a ferrite core memory. By 
the use of four turns on the access core, the drive current 
into the magnetic switch becomes 600 ma. The back 
voltage, however, is low. The output signal at the word 
location is 8 mv. The transmission properties of the 
sense wire must be considered for large memories. The 
propagation time per bit is about 0.06 m/xsec/bit. Since 
the sense wire is resistive, the output signal may be at- 
tenuated as much as 1 db per 512-word module. Larger 
memories may be made by interconnecting several 


modules, but the number of modules which may be con- 
nected to one sense amplifier is limited by the attenua- 
tion of the wrapped wire. 

The present memory is an initial model which has 
been developed to demonstrate feasibility of the system 
as well as to meet certain operational requirements. 
Models will be made soon which have improved char- 
acteristics. In particular, the size of the memory may be 
reduced by a factor of two or more. The output signal 
from the sensed bit may be increased. The transmission 
line properties of the structure may be improved by 
making the conducting wire larger. 
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A Card Changeable Nondestructive Readout 

Twistor Store* 


J. J. DEBUSKEf, J. JANIK, 


Introduction 

W ITH the steady increase in the required speed 
and complexity of computing and data process- 
ing equipment necessary to handle today’s prob- 
lems, the role of the associated storage systems has been 
expanding. Generally, a rather large amount of perma- 
nent or semipermanent storage capacity is required to 
store information such as programs, constants, and 
tables. Besides having a large bit capacity at low bit 
cost, such stores must be fast, reliable, and flexible in 

* The work reported in this paper was done for the U. S. Depart- 
ment of Defense under Contract DA-30-069-ORD-1955. 
f Bell Telephone Labs., Inc., Whippany, N.J. 
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addressing. The twistor sensing an array of small per- 
manent magnets meets these requirements admirably. 

The “Twistor” as a memory element was conceived 
by Bobeck. 1 It may be used as either a memory or a 
sensing device. It is as a sensing device that it is used 
in the store described in this paper. The details of the 
magnetic structure are given in a companion paper. 2 

This memory matrix utilizes cards containing a space 
for a small magnet at each bit position. A magnet is 

1 A. H. Bobeck, “A new storage element suitable for large-sized 
memory arrays — the Twistor,” Bell Sys. Tech. J., vol 36, pp. 1319- 
1340; November, 1957. 

2 D. H. Looney, “A twistor matrix memory for semipermanent 
information,” this issue, p. 36. 



42 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


present if a zero is stored and absent if a one is stored. 
The model described in this paper is word-organized. 
One row across a card forms a word. These cards are 
placed over columns of twistor wires, one twistor wire 
for each bit which is used to sense the presence or ab- 
sence of a magnet for the bits of the word being ad- 
dressed. 

By combining memory cells using the cards and 
twistor wire arrays with suitable access, timing, and 
readout circuitry, a store is obtained compatible with 
high-speed data systems. Furthermore, the information 
on the cards is permanent and easy to check. The 
stored information can be readily changed by removing 
the cards and replacing them with others. 

The 512-Word, 26-Bit-per-Word, Store 

A block diagram of the store is shown on Fig. 1. In- 
formation is stored on 16 cards. Each card stores 32 26- 
bit words. Computer access to this stored information 
is via the timing and memory access circuitry. In order 
to sense the information stored in a particular word 
location, the computer must supply a “read start” signal 
and the binary coded address of the desired word. Upon 
receipt of the “read start” signal, the timing circuit 
generates the internal timing necessary to complete the 
operation. The information read out of the memory 
module is detected and amplified by the read detectors 
before being passed on to the computer at a logic level 
of —3.5 volts. 

The Twistor Memory Module 

Each of the 512 words has a word access core and a 
word solenoid associated with it, as shown on Fig. 2. A 
group of 26 twistor wires, encapsulated in a plastic tape, 
threads the word solenoids. The region along each 
twistor wire, defined by the width of the associated 
solenoid, constitutes one bit of a word. A twistor wire 
consists of a 4-79 permalloy tape, of cross section 0.005 
inch by 0.0003 inch wrapped around a 0.003-inch cop- 
per wire at an angle of about 45°. The 26 twistor wires 
are magnetized in the same direction, which is defined as 
the set state. 

To simplify the description of the operation of the 
basic memory cell in Fig. 2, only a vertical winding is 
shown threading the cores. A current “I” of sufficient 
amplitude in the vertical wire threading a word core re- 
verses the direction of the magnetic flux in the core. 
This change in flux induces a current in the solenoid 
which produces a magnetic field strong enough to re- 
verse the direction of magnetization of the sections of 
the twistor wires within the solenoids. 

A twistor bit having an associated magnet is pre- 
vented from switching its direction of magnetization by 
the presence of the external magnetic field of the magnet 
(bit 25), (Fig. 2). This field is in the same direction as 
the set state of magnetization of the twistor wire. The 
rate of change of flux in a twistor bit that is switched in- 
duces a voltage in the twistor wire. The passage of cur- 
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Fig. 1 — Basic block diagram of the store. 
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rent in the opposite direction through the word access 
core resets the core, which in turn restores the switched 
twistor bits to their set state. Therefore, in this applica- 
tion, the twistor elements are used to sense the presence 
or absence of permanent magnets. The presence of a 
permanent magnet represents a stored zero, and the 
absence of a magnet represents a stored one. 

The 512 core-solenoid combinations are arranged and 
wired as a biased core switch 3 in a 32 by 16 matrix. (See 
Fig. 3.) Note that the 26 tape-encapsulated twistor 
wires are in one continuous folded belt. A vertical drive 
winding threads each core of a particular column with 
four turns. A horizontal drive winding threads each core 
of a particular row with four turns. The vertical and 

3 J. A. Rajchman, “A myriabit magnetic-core matrix memory,” 
Proc. IRE, vol. 41, pp. 1407-1421; October, 1953. 
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horizontal drive currents and windings produce aiding 
flux at a cross point in each core. A constant current 
bias winding threads each core of the matrix. The coin- 
cidence of currents in the horizontal and vertical drive 
windings of a word core overcomes the effect of the bias, 
and thus switches the core resulting in the readout of 
the information stored in that word location. The 
switched bits are restored to their former states by the 
action of the bias current which resets the word access 
core when the horizontal and vertical drives are re- 
moved. In order to select word 1, Fig. 3, it is necessary to 
supply pulse currents, of sufficient amplitude and dura- 
tion, on horizontal drive winding “a” and vertical drive 
winding “1”. 

With horizontal and vertical drive currents of 0.6 
ampere into 4 turns each, and a bias current of 2.25 
amperes, the twistor output for a stored 1 is approxi- 
mately 8 mv and about 2 mv for a stored zero. When a 
word access core is switched, the other 31 cores of the 
row and 15 cores of the column are shuttled by the 
drive currents. The effect of these shuttle currents and 
inductive pick-up noise is reduced by staggering the 
time of application of the drive currents and by strobing 
the read detector outputs. 

In the present 512-word memory module, the trans- 
mission properties of the twistor circuit produce an out- 
put signal attenuation of as much as one db and a prop- 
agation time of about 35 m/xsec. In order not to penalize 
the signal-to-noise and timing of the words most remote 
from the read detectors, two modules of the present 
type may be read with a single set of read detectors. 


The Application of the Memory Module 
to the Store 

A diode, WECo type 1N2146, is added in each hori- 
zontal and vertical drive lead to block “sneak” paths. 
The leads are interconnected as indicated on Fig. 4. 
This arrangement permits the selection of one of the 16 
horizontal drive leads by a 4 by 4 selection switch and 
the selection of one of the 32 vertical drive leads by an 8 
by 4 selection switch. The selection switches and the de- 
coders comprise the memory access circuits. (See Fig. 
5.) Under the control of the binary address supplied by 
the computer, the decoders select the horizontal and 
vertical drivers required for access to a particular word. 
The selected drivers supply the necessary drive currents 
under the control of the timing circuit. 

The outputs of the twistor wires are amplified by the 
read detectors to logic level ( — 3.5 volts) and passed on 
to the computer during the strobing interval. The strobe 
gates of the read detectors are controlled by the timing 
circuit. 

When the timing circuit receives a “read start” signal 
from the computer, it generates the horizontal drive, 
vertical drive, and strobe signals, as shown in Fig. 6. 
Note that the vertical drive signal occurs approximately 
0.5 jusec after the horizontal drive signal. The selected 
drive currents flow in the word-access core matrix dur- 
ing the interval the drive signals are applied. The sole- 
noid current and the twistor output signal are shown on 
the figure for reference. The strobe gates of the read 
detectors are activated during the strobe interval of 0.2 
nsec commencing approximately 1.4 /x sec after the “read 
start” signal. 

The function of the initial magnetizing circuit (Fig. 
5) is to insure the uniform magnetization of the twistor 
wires in the same direction. This is accomplished by 
passing a 10-/xsec, 200-ma current pulse through the 
twistor wires, and must be done before the store is 
placed in operation or whenever the program cards are 
installed or replaced in the memory. This circuit is op- 
erated by a switch located on the control console of the 
computer. 

Decoder 

The decoding of the address is done in “double-rail” 
logic. To address the 512-word store, 9 binary address 
bits and their complements are required, these being di- 
vided into 4 groups. Three bits and their complements 
are used to select one of the 8 top vertical drivers \Fig. 
5) ; 2 bits and their complements are used to select one 
of the 4 bottom vertical drivers; 2 bits and their com- 
plements are used to select 1 of the 4 right horizontal 
drivers; and 2 bits and their complements are used to 
select 1 of the 4 left horizontal drivers. The address bits 
from the computer are the outputs of the flip-flops of 
the address register and appear as steady ground or 
— 3.5 volt signals. The address bits are assigned so 
that the first 32 words will appear in order from 1-32 on 
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Fig. 4 — Interconnection of drive leads. 




HORIZONTAL DRIVE 

Fig. 7 — Decoder logic diagram. 


zontal drive signal. Similar logic is used in the vertical 
for address decoding. 

Drivers 


Fig. 5 — Complete store block diagram. 

the first program card in the first slot, and the follow- 
ing words through 512 will appear consecutively in the 
adjacent slots. 

One of the 2 horizontal decoders is illustrated in the 
logic diagram of Fig. 7. The inverters are provided for 
fan-out purposes. The 2 bits and their complements are 
combined with the horizontal drive signal (in “and” 
circuits) to produce a l-out-of-4 selection. The selected 
lead remains at —3.5 volts for the duration of the hori- 


The horizontal and vertical drive currents are each 
controlled by a pair of drivers capable of providing an 
0.6-ampere pulse with a 0.5-jusec rise time. A vertical 
winding (4 turns shown as one wire) with drivers and 
cores is shown on Fig. 8. The bottom driver of the pair 
has a current monitoring resistor R8 and a feedback 
network composed of diode CR1 and resistors R7 and 
R8. The feedback network monitors, and thereby regu- 
lates, the drive current of 0.6 ampere. Transistor Q1 is a 
WECo type 2N560, and transistors Q 2 and Q3 are 
WECo type 2N1072. 
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Read Detector 

The read detector schematic is shown in Fig. 9. 
Transistors Q1 and Q2, WECo type 2N559, comprise a 
stabilized feedback linear amplifier 4 whose gain is given 
to a good approximation by 

H 21 = 1 T R3/R4. 

Transformer T1 is used to match the impedance of the 
twistor circuit to the input impedance of the amplifier 
stage. The third stage, transistor Q3, is normally held in 
a conducting condition by a current from the minus 
voltage supply through resistor R7. The signals at the 
collector of Q2 are of such a polarity as to tend to cut 
off the collector current of transistor Q3. A thresholding 
function is obtained because an output signal (one) 
from the twistor, amplified by Q1 and Q2, must over- 
come the base bias of Q3. The collector circuit of tran- 
sistor Q3 is designed to be compatible with the connect- 
ing logic circuitry of the computer. Transistor Q4 is 
used to “strobe” the output of transistor Q3. This pro- 
vides discrimination against signals coming from the 
memory except when the output signal is expected. 
Clearly, both a twistor output signal and the strobe sig- 
nal must be present simultaneously or no output results. 
Both Q3 and Q4 are also 2N559’s. 

Timing Circuit 

All of the timing for the store is generated within the 
store. The “read start” signal from the computer is the 

4 F. D. Waldhauer, “Wide-band feedback amplifiers,” IRE 
Trans, on Circuit Theory, vol. CT-4, pp. 178-190; September, 

1957. 
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Fig. 10 — Timing circuit logic diagram. 


only externally generated signal and is used to initiate a 
series of univibrators (U). (See Fig. 10.) The univibrator 
is basically a 1-shot multivibrator whose pulse width is 
determined by an RC time constant. One univibrator is 
used to generate the 3.0-/isec horizontal drive pulse 
(Fig. 6) Two univibrators are used to generate the 
vertical drive pulse, one univibrator to provide the 
0.5-;usec delay and the other to determine the pulse 
width. Similarly, 2 univibrators are used to generate the 
strobe signal. 

512-Word Store (Mechanical Construction) 

The store is assembled with its twistor memory mod- 
ule, decoders, drivers, timing circuit, and read detectors 
as one package. (See Fig. 11.) The weight of the store is 
70 pounds and its over-all dimensions are 9 by 21 by 
22 inches excluding power supplies. 
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Fig. 11—512-word store (mechanical construction). 


The memory module is located toward the front of 
the package for ease of card changing. The read de- 
tectors are located immediately to the rear of the 
memory to keep the low-level signal leads as short as 
possible. Two partial planes are used to mount the read 
detectors. Below these are mounted the memory 
module, timing circuit, and diode circuits. The bottom 
plane is reserved for the decoders and drive units. 

The entire store can be disassembled by separating 
the individual planes. Connectors at the front and rear 
of each of the planes carry signals upward and down- 
ward between planes. Computer access is outward 
through rear connectors. 

The store contains 200 2N559 diffused base switching 
transistors, 20 2N560 diffused silicon switching tran- 
sistors, 40 2N1072 silicon, high current, switching tran- 
sistors, and 48 1N2146 silicon diodes. Approximately 50 
watts of power is required for the operation of the store. 
This power is supplied by 3 direct current supplies, — 12 
volts, +12 volts, and +32 volts. 

Program Card 

The program card consists of 32 rows of 26 or less 
Vicalloy magnets per row. (See Fig. 12.) The magnets 
are 0.060 inch long, 0.020 inch wide, and 0.002 inch 
thick. Two types of cards are used because the twistor 
tape folds back and forth through the memory and the 
twistor wires are uniformly magnetized in one direction. 
Red cards are used in the even slots and are magnetized 
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Fig. 12— Program card. 

left to right; green cards are used in the odd slots and are 
magnetized right to left. 

Each card has a tab on which the programmer marks 
the card number and the program symbol when the 
card is prepared. Lines are provided on the card separat- 
ing the 26 bits of each word into five 5-bit columns and 
one 1-bit column to assist the programmer. Further, 
each word ^numbered for his use. 

Summary 

Information is stored on removable cards which can 
be prepared quickly by stenographic personnel if de- 
sired. Twistor elements are utilized to sense the informa- 
tion which is in the form of tiny permanent magnets. 
This store uses a basic 512-word twistor memory module 
and is packaged with all of its associated solid-state cir- 
cuitry as one integrated and removable unit. It is op- 
erated on a 5-fj.sec cycle time and requires approxi- 
mately 50 watts of power. A 2048-word, 26-bit-per- 
word, store is under development; in principle it op- 
erates in the same manner as the 512-word store, except 
that four twistor memory modules are wired together. 
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Square-Loop Magnetic Logic Circuits 

EDWARD P. STABLER! 


Introduction 

F ERRITES, ferroelectric capacitors, and some fer- 
romagnetic materials all possess bulk character- 
istics which are usually referred to as square-loop 
properties. It has been recognized for some time that 
these properties are particularly compatible with digi- 
tal-device requirements. This paper attempts to cate- 
gorize the various methods which are utilized in device 
synthesis. In addition, it introduces an equivalent cir- 
cuit model of the devices to determine some of their 
attributes and limitations. The model is essentially a 
nonlinear resistive and reactive network whose parame- 
ters are determined by the internal state of the device 
and by its present state of excitation. For the sake of 
simplicity, the discussion will be limited to magnetic 
devices. Other authors have pointed out that a direct 
ferroelectric equivalent exists for multiple-aperture 
magnetic devices. 1 

Significant Properties of Network Elements 

The significant properties of an element of square- 
loop material are threshold, memory, and saturation. 
These properties are described more precisely below. 
Some rather gross assumptions are made; however, a 
more refined description does not seem warranted be- 
cause of the increased analytical difficulty. The basic 
magnetic element, shown in Fig. 1, is a cylinder whose 

i * LENGTH 

A = CROSS SECTIONAL AREA 

M « J H*dt 

Fig. 1 — Elementary magnetic element. 

length is considerably larger than either of its other di- 
mensions. The properties of the element are described 
in terms of the terminal magnetic variables. The cylin- 
der is a two-terminal circuit element. The magneto- 
motive force M between the two terminals is a function 
of the flux passing through any cross section. In this 
approximation, the magnetic field is assumed to be con- 
stant over the length of the element. 

| General Electric Co., Syracuse, N. Y. 

1 T. E. Bray and B. Silverman, “Shaping magnetic and dielectric 
hysteresis loops,” Proc. Special Tech. Conf. on Solid-State Dielectric 
and Magnetic Devices, Catholic University of America, Washington, 

D. C; April, 1957. 



Threshold Property 

For values of M below a threshold value Mo, the 
terminal relationship is 



dM 

dt 


However if M exceeds Mo, switching may take place. 
The terminal variable relationship is 

d(f> dM 

— = Kx — ■ + K 2 (M - Mo). 

dt dt 

The first term represents the reversible flux change 
and the second term represents the irreversible flux 
change during switching. The element has a negative 
threshold as well as a positive one. If M is negative and 
has a magnitude greater than Mo, switching may also 
occur, hence 

d<j> dM 

— = K x b K 2 (M + Mo). 

dt dt 


Memory Property 

The change in <f> during switching is the sum of two 
terms. The first term, proportional to Kx, is a reversible 
term. It represents the lossless linear flux change 
caused by an applied magnetomotive force. The second 
term, proportional to K 2 , is an irreversible change. A 
net change in <f> proportional to K 2 will occur during 
switching. The implication is that d> has many stable 
values for zero applied magnetomotive force. The 
memory of the element is associated with this property. 
We can define an internal state S, which is related to 
the flux <f>, in the element with zero applied field as 

5 = Ko<p with M = 0. 

Choose Ko so that the magnitude of S never exceeds 
unity. 


Saturation Property 


The total irreversible flux change that can take place 
is limited by saturation. When the material is saturated 
by a positive drive, the terminal relationship is 


dcj> dM 

— = Kx 

dt dt 


for any applied magnetomotive force greater than — Mo. 
In negative saturation a similar expression is obtained: 



dM 

dt 



48 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


for all M less than -{-Mo. Saturation is related to the 
irreversible flux changes which have taken place. We 
have chosen the constant Ko so that 

5 = + 1 for positive saturation 


and 


S — — 1 for negative saturation. 


Now we can summarize the complete set of terminal 
relationships. 

1) We define S = K 0 (tp — K x M), where we have sub- 
tracted the reversible flux contribution caused by 
an applied mmf. 


2 ) 

3) 

4) 

5) 

6 ) 


d(f> 

dt 

dtp 

dt 

dtp 

dt 

dcj> 

dt 

d<p 

dt 



dM 


Kx 


for M < | M 0 ] , 


dt 



dM 

r s = i i 

Kx 


for 


dt 

L M > - M 0 J 


dM 

rs = - n 

Kx 


for , 


dt 

L M < MoJ 


dM 

s r\s\ 3* n 

Kx 


+ Kz(M — M 0 ) for , 


dt 

LM > MoJ 


dM 

r r 1 *5*1 ' 

Kx 


+ Kz(M + Mo) for 


dt 

LM < — Mo. 


The constants Kx, Kz, K 0 , and M 0 depend on the bulk 
material and the dimensions of the element. .Mo is 
usually a function of S, but this fact will be neglected 
here. 

A fourth property implicit in the above relationships 
is symmetry. If we conduct an experiment starting from 
state S, applying a drive M(t), we observe the response 
<p{t). This experimental result implies that an experi- 
ment starting at —S, consisting of the application of 
— M(t), will have the result 

All of the terminal relationships listed are linear dif- 
ferential equations. The magnetic element is nonlinear 
because the applicable differential equation depends on 
the applied drive and the internal state. The next sec- 
tion introduces a nonlinear electrical circuit equivalent 
to the magnetic circuit element. 


Equivalent Electrical Circuit 

The equivalent electrical circuit will consist of linear 
electrical components and switches which are actuated 
at the turning points 

I S\ = 1 


or 

| M | = Mo. 

We choose to relate 


dtp 

/~N»/ electrical current, 

dt 


The constraints on dtp/dt and M, imposed when several 
elements form a magnetic network, are exactly the 
constraints on i and e in electrical networks. In Fig. 2 
we show three two-terminal electrical networks for 
which the relationship of the terminal variables e, i is 
identical to the relationship between M and d<p/dt. The 
equivalent electrical network that should be used is a 
function of the internal state S and the applied voltage. 

A capacitor appears in the equivalent circuit because 
we are forming the analog of the magnetic variable re- 
lationship. This relationship is different from that ob- 
served at the electrical terminals of a winding on the 
magnetic element. 

If an electrical circuit, such as a drive winding or an 
output winding, is coupled to an elementary magnetic 
element, the relationship between the terminal variables 
M and dtp/dt is modified. Fig. 3(a) shows a magnetic 
element with a coupled electrical circuit which has been 
reduced to its Norton equivalent circuit. The previ- 
ously derived circuit is appropriately modified by the 
addition of a series impedance numerically equal to 
Y e N 2 and a series voltage source numerically equal to 
NI e . Modification of the magnetic-network-element 
parameter relationships by means of coupling to elec- 
trical circuitry is useful in some synthesis problems. 

When two or more magnetic elements are coupled by 
the same electrical circuit, an extension of the preceding 
technique is used to obtain an equivalent circuit. This 
procedure is not always desirable because there is no 
longer a close relationship between the graph of the de- 
rived equivalent circuit and the geometry of the mag- 
netic device. In such cases, it is preferable to solve the 
network using a mixed (magnetic and electrical) set of 
independent variables. 


Physical Interpretation 

It is reasonable to expect that the equivalent circuit 
parameters can be determined from the dimensions of 
the element and the properties of the bulk material. 
From the magnetic material properties, we obtain: 

B s = saturation flux density, 

Ho = threshold magnetic field, 

S w = switching constant, 
ju = small signal permeability. 

The magnetic element has a length l, and a cross- 
sectional area A. We solve for the circuit parameters: 


Ko 


K z 


1 

B S A 

2B g AS w 

l 


K x 


nA 

l 


M ~ e voltage. 


Mo - Hoi 
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USE 

WHEN 




S=l AND 

e> - Eo 

d(|) 

OR 

S=-l AND 

e> -Eo 

dt 

OR 

|s|*l AND 

Eo 


USE 
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. n 
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Fig. 3 — Modified equivalent circuits for magnetic element coupled 
to an electrical circuit. 


Fig. 2 — Equivalent electrical circuits. 


In practice, these values are reasonably accurate with 
the exception of the solution for K x . K x represents the 
reversible flux-change term and depends quite notice- 
ably on dimensional relationships other than those 
mentioned, upon the electrical winding configuration 
and the internal state S. One contributing reason for 
this effect is that the small signal permeability of the 
materials used is often less than two orders of magnitude 
greater than that of air. It has been assumed that the 
flux entering or leaving the magnetic element through 
its walls was negligibly small compared to the amount 
of flux entering or leaving the ends of the element. 
Careful design is necessary if this assumption is to be 
a useful one. 

Although the reactive nature of the equivalent circuit 
plays an important part in the analysis of a given device, 
it does not have the nonlinear property utilized in the 
digital-device synthesis. For this reason only the non- 
linear resistive portion of the equivalent circuit will be 
used during the discussion of synthesis techniques. As 
a result, the network will consist of linear resistors, bat- 
teries, and switches which provide the nonlinear charac- 
teristic. In addition, an internal state of 5= + 1 will be 
indicated by an arrowhead in the usual way. Elements 
in other internal states will have no arrowhead notation. 
Once a network has been synthesized in graph form, it 
is essential to perform a thorough analysis to fix the 
optimum geometric ratios and winding configurations. 
Fig. 4 shows the simplified equivalent circuit and no- 
tation. 


Logical Function Synthesis 

One method of synthesis may be called flux steering. 
In this type of operation, the device is put into an ag- 
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Fig. 4 — (a) Elementary unit, (b) simplified equivalent 
circuits, (c) graphical notation. 


gregate internal state dependent on the input binary 
variables. For each internal state, the output binary 
function is either equal to unity or equal to zero. When 
the internal state corresponds to a unity output, the 
device will respond to a drive D by having a large flux 
change all along a closed loop L called the output loop. 
The output function is zero when the response to the 
drive D is different from that previously described. 
Either the path along which switching takes place 
(along which d<t>/dt is large) will differ from L, or there 
may be no switching at all. Any Boolean function may 
be written in canonical form as the conjunction of dis- 
junctive polynomials or, alternatively, as the disjunc- 
tion of conjunctive polynomials. We choose the former 
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representation to show a direct device synthesis. As an 
example, let 

g = f(Xi, X 2 , X 3 , X 4 ) 

g = (Xi + X 2 )(X 3 + X 4 )(Xi + X 3 ) 

where g is a binary function of the four binary-input 
variables. 

A device which will generate to this function is a 
three-input AND gate as shown graphically in Fig. 5. 
It is shown initially in its rest state prior to the recep- 
tion of input drives. The closed path formed by h, l 2 , k, 
is the output loop L. The control legs l c i, l c 2 , and / c3 may 
be switched during the input period by drive windings 
controlled by the input variables X x , X 2 , X 3 , X 4 placed 
on the control legs. Fig. 6 shows the approximate 
equivalent circuit of the device during an input period 
in which 

X x = 1 
X 2 = 1 
X 3 = 1 
X 4 = 0. 

This input signal will cause all three elements forming 
the output loop L to switch. The device may be designed 
so that the control legs saturate, or reach S — — 1, at 
the same time that the controlled legs h, h , k reach 
S = — 1. A subsequent drive tending to switch the out- 
put loop L will switch each element of the output loop. 
If the input state had been 

X x = 1 
X 2 = 1 
X 3 = 0 
X 4 = 0, 

then leg l 2 would not have been switched during the in- 
put period. Consequently, Z 2 would not switch during 
the output drive. There are a number of variations of 
this operation which tend to improve its characteristics. 
The example given, however, sufficiently illustrates the 
principle of operation of a class of gates called flux- 
steering gates. We restate this principle for emphasis. 

In a flux-steering gate the internal states of a number 
of elements in an output loop L are controlled by a 
number of control elements. Switching takes place in 
every element in L during the output period if, and 
only if, the output binary function is equal to unity. 

It is clear from the discussion that theoretically a 
single flux-steering gate can generate any combinational 
logical function. 

Gates of this type have been constructed at the 
> General Electric Electronics Laboratory. Reasonable 
power gain is available. Fig. 7 shows photographs of 
the output signal of a two-input flux-steering gate for 
unloaded and loaded operation. 



Fig. 5 — Graph of three-input AND gate in initial stage. 



E c 3 r c 3 

Fig. 6 — Equivalent circuit of three-input AND gate 
during input period. 


A second method of synthesis is named flux summa- 
tion. 2 Again the function is written as the conjunction 
of a number of disjunctive polynomials. df the Boolean 
function is the conjunction of n disjunctive polynomials, 
the device will have n input elements, n— 1 shunt ele- 
ments, and one output element. The device has a rest 
state which precedes any input time period. The geome- 
try is such that switching takes place in the output ele- 
ment only when all the input elements are switched 
during the input period. As an example, we will syn- 
thesize the function 

g = (X x + X 2 )(X 3 + X 4 )(Xi + X 3 ). 

A graph of the device in its rest state is shown in Fig. 8. 
Elements h, h , and / 3 are input elements; elements h and 
k are shunt elements which may be combined if desired; 
and element k is the output element. All elements have 
the same cross-sectional area. During the input period, 
elements / x , l 2 , and k may be driven and switched. The 
element k has a high threshold because of its additional 

2 N. F. Lockhart, “Logic by ordered flux changes in multipath 
ferrite cores,” 1958 IRE National Convention Record, pt. 4, pp. 
268-278. 
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Fig. 7 — Flux-steering gate output. Scale: 0.05 v per turn per division 
(vertical) and 0.2 /xsec per division (horizontal). Top: unloaded; 
bottom: loaded. 



length. As a result, if two or less of the input elements 
are switched, the output element will not switch. The 
output element will switch only if all three of the input 
elements are switched. This results from the fact that 
the two shunt legs will saturate when any two of the 
input legs switch under the influence of input drives. 
If all three input elements are switched, the two shunt 
legs become saturated before the input elements satu- 
rate and cannot undergo further large flux changes. 
Switching in the output element ensues. The threshold 
M 0 of the output element is made considerably greater 
than that of the two shunt elements. 

It is obvious that the device forms a three-input 
AND gate as shown. Multiple windings on the control 
elements perform the disjunction operation. On element 
1, two windings are placed; one is excited whenever 
Xi = l, and the other is excited whenever AT 2 = 1. The 
excitation provided by either winding is sufficient to 
switch element 1. The other control elements are wound 
in the same way. Various means are available for sensing 
the output element. The shunt elements may be com- 
bined into a single element or may be used to generate 
other functions of the input variables. This type of gate 
is only briefly discussed here but is covered thoroughly 


by Lockhart. 2 A single gate of this type can generate 
theoretically any combinational logical function. The 
principle of this gate is restated for emphasis. In a flux- 
summation gate, switching of an output element is con- 
trolled by the presence of shunt elements in parallel with 
the output element. During the input period, the switch- 
ing of input elements will cause switching in the shunt 
elements. Switching will occur in the output element 
only if the shunt elements saturate before the input ele- 
ments saturate. Actually this description has been re- 
stricted in order to emphasize the principle. 

A third type of logical-function synthesis is the relay 
analog method. It is related to the flux-steering gates 
but is sufficiently different to warrant separate treat- 
ment. The graphs of the relay analog unit in its two 
possible input states are shown in Fig. 9. The closed loop 
formed by / 0 , h, and l c is capable of storing information 
in the same way as a magnetic-memory core, by satura- 
tion of the three elements in a clockwise or counter- 
clockwise direction. All three elements have the same 
cross-sectional area, and normally l c has a higher 
threshold than either l\ or l 0 . The closed loop is initially 
saturated in one direction or the other by means of an 
input variable X. Suppose X = 1 corresponds to counter- 
clockwise saturation. Now the element is read out by 
applying one mmf, M, as shown in Fig. 10. As a result 
of M, a flux change will take place in Z t (the input lead) 
and in Zi (the unity output lead). Little or no irreversi- 
ble flux change will take place in Z 0 (the zero-output 
lead). It is clear that these devices can be cascaded to 
form a wide variety of functions. A symmetric tree of 
three input variables is shown in Fig. 11, along with the 
relay equivalent. The output leads are Z 0 , Z\, Z 2 , Z 3 
where the subscript refers to the number of input vari- 
ables equal to unity. Fig. 1 1 shows the internal state of 
the device when X\ = 0, X 2 = 1 , X$ — 1 . Theoretically, de- 
vices of this type may be cascaded to generate any com- 
binational function. 

Relay analog elements have been built at the Elec- 
tronics Laboratory. The ratio of the irreversible flux 
changes which take place in the two output paths, Z\ 
and Z 0 , is about 15:1 for a single stage. Fig. 12 is a 
photograph of dxp/dt for the two paths. 

We have described three different techniques of syn- 
thesizing a gate to generate a combinational logical 
function. The problems associated with the input and 
output circuitry have not been considered. There are 
a number of different types of circuitry which may be 
used to couple the magnetic gates. 3-7 

3 A. Wang, “Magnetic delay line storage,” Proc. IRE, vol. 39, 
pp. 401-407; April, 1951. 

4 R. D. Kodis, et at., “Magnetic shift register using one core per 
bit,” 1953 IRE Convention Record, pt. 7, pp. 38-42. 

6 V. Newhouse and N. Prywes, “High-speed shift registers using 
one core per bit,” IRE Trans, on Electronic Computers, vol. 
EC-5, pp. 114-120; September, 1956. 

6 D. Loev, et at., “Magnetic core circuits for digital data-process- 
ing systems,” Proc. IRE, vol. 44, pp. 154-162; February, 1956. 

7 M. Karnaugh, “Pulse-switching circuits using magnetic cores,” 
Proc. IRE, vol. 43, pp. 570-584; May, 1955. 
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Fig. 9 — (a)XRelay analog unit, (b) relay analog unit storing a “1,” 
(c) relay analog unit storing a “0”. 




Fig. 12 — Signal and noise responses of the relay analog unit. 
Time scale: 1 ;usec/cm. 


Fig. 10 — Output period drives applied to relay analog unit. 



(a) 



Fig. 11 — (a) Relay analog symmetric tree, 
(b) relay symmetric tree. 


In general, all the techniques successfully used in 
conventional-core logic may be used for multiple-aper- 
ture gates. In the next section, we discuss another tech- 
nique of interconnection fundamentally different from 
core logic output circuitry and unique to multiple-path 
logic gates. 


Magnetic Interconnection 

The conventional magnetic-core logic interconnection 
circuitry is used to force switching of the input leg of one 
gate when switching occurs in the output leg of another 
gate. Since both these events are merely flux changes, 
the possibility of coupling the two entirely within the 
magnetic medium seems promising. The magnetic- 
interconnection network should permit unambiguous 
signal propagation and should not adversely affect the 
operation of the units which it couples. 

Fig. 13(a) shows a graph of a magnetic-network ele- 
ment with properties similar to an electrical diode. The 
approximate terminal characteristics are given in Fig. 
13(b). Leg h is saturated as shown. Leg l r has a much 
higher threshold than l d , and a much higher cross- 
sectional area and saturation flux. The element is shown 
in its initial state. The diode element will be much more 
responsive to a positive-applied M than to a negative- 
applied M. Information can be transmitted from left 
to right, but not in the reverse direction. If the diode 
unit is used to transmit information, switching may take 
place in Id. As a result, it will no longer have the desired 
diode characteristic and must be reset. A drive applied 
to l r forces h back to its original saturation state. These 
diode units have the required properties for direct inter- 
connection of two magnetic elements. The next para- 
graph demonstrates their utility by describing a digital- 
delay unit in which the storage locations are coupled 
through magnetic diodes. It is important to note that 
repeated transmission of information in a single direc- 
tion will lead to saturation of l r so that h cannot be 
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Fig. 13 — Magnetic diode terminal characteristics. 


reset. In any specific design, techniques can be found 
to prevent this occurrence. Also, other types of coupling 
networks can be used in which this problem does not 
arise. The diode described above is only intended to 
indicate an approach to the problem. 

Digital- Delay Component 

Fig. 14 illustrates a section of a digital-delay device 
which uses magnetic diodes to couple information- 
storage units. In order to advance the stored "1” to the 
right, two pulse drives occur in sequence. First, all odd- 
numbered vertical legs are driven upwards. The diodes 
prevent information flow to the left. Only the storage 
location storing a “ 1 ” will switch. For the state shown 
in Fig. 14, h, and h will switch. The next pulse resets 
the diodes. The final state is shown in Fig. 14(c). The 
stored “1” has moved to the right. The next advance is 
caused by a pulse drive applied to all the even-numbered 
vertical legs, followed by a diode reset drive. We have 
described the digital-delay unit only briefly because we 
intend to illustrate the possibilities of the synthesis ap- 
proach rather than to describe a practical device. 

The magnetic-diode element may also be used to 
couple the output leg of a gate to an input leg of 
another gate. It is possible to construct fairly compli- 
cated logical machines in which the information is 
propagated entirely within the magnetic medium. 

All-Core Networks 

Networks built entirely of simple cores and copper 
windings exist which are equivalent to any of the 
multiple-aperture devices which have been described. 
A brief introduction to the subject is given here. 

When magnetic elements are combined in a network 
to form a multiple-aperture device, nodal constraints 
are imposed on the flux levels in the elements. At a node 
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Fig. 14 — Digital-delay element. 

£<*> = o 

and 



If windings on several different cores are connected in 
series to form a short-circuited loop. 

for the series loop. If there are no other impedances, all 
the voltages in the loop are induced voltages 

d(j> 

e = N — • 
dt 

The flux changes which take place in the cores coupled 
by the series loop obey the following equation 


This constraint is similar to the nodal constraint of 
magnetic networks. Following this procedure, a core 
circuit equivalent to any multiple-aperture device can 
be found. The relative advantages of the two classes of 
circuitry will be discussed in a subsequent paper. 

Conclusions 

We have described an elementary magnetic element. 
This can be used as a building block for any multiple- 
aperture device. We have also presented a terminal 
variable relationship of the element, and introduced a 
nonlinear electrical equivalent circuit which may be 
used to analyze the operation of a multiple-aperture 
device. Three synthesis techniques have been given for 
obtaining a multiple-aperture device to generate any 
combinational logical function. Such devices may be 
coupled by conventional-core logic circuitry. 

The possibility of coupling logical gates entirely 
within the magnetic medium has been discussed briefly. 
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A simple example of a magnetic “diode” unit was used to 
demonstrate that square loop materials can be used in 
interconnection circuitry. If this procedure is followed, 
many logical operations may be performed on the in- 
put formation while the information is kept continu- 
ously within the magnetic material. 

All-core networks may be derived which are equiva- 
lent to the multiple-aperture devices. The networks 
which result have many advantages over their multihole 
counterparts. These networks will be discussed in a 
future publication. 

Future Prospects 

Coupling of logical gate devices within the magnetic 
medium has many attractions. 


1) Compatibility — the physical input quantities are 
of the same magnetic nature as the physical out- 
puts. Direct coupling permits scaling down physi- 
cal dimensions to reduce the power consumption. 

2) Resistance to noise — the magnetic elements are 
natural integrating devices. In a noisy environ- 
ment they will not respond to large but short-lived 
noise impulses. 

3) Reliability — the reliability of magnetic materials 
is seldom questioned. The circuits described here 
contain no electrical components except for a 
clock pulse source and drive windings. 

Future effort should be devoted' to developing simple, 
flexible magnetic-interconnection circuitry suitable for 
coupling the various types of logical gates. 


Relative Merits of General and Special Purpose 
Computers for Information Retrieval 

A. OPLERf and N. BAIRDf 


Introduction 

I NCREASING attention to automatic information 
processing is being given by all sections of our 
■ technology, commerce, and military operations. 
One of its more important aspects is the use of comput- 
ing machines for storage and subsequent retrieval of 
information by request. 

There have been two simultaneous patterns evolving 
in the last decade. One group has borrowed the equip- 
ment used for standard accounting and engineering 
calculations and has demonstrated the practicability of 
automatic information retrieval (IR) using such equip- 
ment. A second group has concentrated on designing 
special equipment for use solely for mechanized informa- 
tion retrieval. 

We are at present in a transition period and it seems 
appropriate at this time to review the progress made 
using each of the two approaches and perhaps to intro- 
duce some constructive cross-fertilization. For the most 
part, this field, like many new and exciting disciplines, 
has produced much controversy, strong prejudices, and 
a tendency to place personal viewpoint above impartial 
analysis. Let us hope this “feudal” period is over. 

Information Retrieval on General 
Purpose Computers 

Excluding military systems, there have been approxi- 
mately eighteen information retrieval systems pro- 

f Computer Usage Co., New York, N. Y. 


grammed and debugged for general purpose (GP) com- 
puters. Some of these were written primarily for pur- 
poses of exploration and others have been actually made 
operational. From a review of this activity, we are now 
able to draw a number of generalizations regarding the 
performance of general purpose computers in the infor- 
mation retrieval area. 

1) The high-speed computer has proven satisfactory 
for both exploration and operation. Intermediate 
speed machines have been satisfactory only for 
exploration, for operation of simple searching 
schemes or use with small collections. 

2) The files of information to be stored were main- 
tained on magnetic tape (one exception employed 
magnetic disk storage). 

3) The full gamut of available machines has been 
used for information retrieval and it appears that 
no currently available logical design is markedly 
superior to any other. Information retrieval sys- 
tems tend to take many forms. For each IR sys- 
tem formulation and for each machine design, 
there will be special programming techniques 
required. 

4) A remarkable variety of document storage for- 
mats and retrieval schemes has emerged. With 
the general purpose computers, it appears that 
the searching system designer is relatively free to 
build using the classification system best suited 
to his needs. Indexing schemes as simple as Dewey 
Decimal and as complex as those required for 
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describing the details of the kinetic processes in- 
volved in the synthesis of pharmaceuticals have 
been programmed with little real difficulty. 

5) Most computer searching systems are designed 
to input the search requirements in formats suit- 
able for use by personnel unfamiliar with com- 
puters and to produce outputs designed for a 
similar group. 

6) Cost of development of such computer systems 
has been very high and the absence of a suitable 
information processing language compiler has 
been sorely felt. 

7) Once a system is developed, the actual searching 
and file maintenance costs have proved to be 
moderate. 

8) The most common technique is that of searching 
a magnetic tape file from beginning to end while 
seeking answers to more than one question (mul- 
tiplexing). The searching speeds so obtained have 
proven better than first anticipated. Common 
speeds range from 1000-15,000 document inter- 
rogations per question per minute. As each new 
program is written, valuable experience is ac- 
cumulating regarding the best programming and 
storage techniques to use. 

9) Document collections thus far used have ranged 
from 1000-50,000 items. In every case, these 
collections have been of “more than average” 
value to the sponsoring organization. Typical 
systems search important collections of patents, 
developmental chemicals, and reports especially 
pertinent to the sponsor’s area of interest. No 
one has yet been willing to expend funds to index, 
code, and store the complete contents of news- 
papers, encyclopedias, or even technical journals. 

10) In no case has the acquisition of a large computer 
solely for information retrieval been recom- 
mended or even suggested. Both the develop- 
mental and operational searching systems share 
machine time with other technical operations. 

Special Purpose Information Retrieval 
Relay Machines 

Compared to these rather imposing accomplish- 
ments, the work done to date by the special purpose 
retrieval machines has seemed less spectacular. During 
the last five years, the use of the GP electronic computer 
has been explored while the special purpose machines 
developed have been primarily electromechanical. Spe- 
cial purpose electronic machines are about to appear. It 
will probably be some time before we can obtain as 
much perspective here as we have on the general pur- 
pose electronic computers. 

The first group of special purpose devices select from 
a collection of separable records (punched hole, optical 
film, and magnetic cards). In each of these devices, the 
file items are passed through a sensing device which an- 
alyzes the code stored on each card. The search criteria 
are established either by wiring patch boards, inserting 
special request cards, or by other means. The selected 


cards may then be used to prepare printed lists, to con- 
trol the selection of fuller documentary records or, in 
the case of film, to project and photocopy the original 
information. 

Another electromechanical device in current use em- 
ploys continuous reels of punched paper tape as input 
and still another uses punched card input and paper 
tape for intermediate storage. In the former case, the 
search requirements are established by patchboard wir- 
ing and in the latter, are established by machine option 
switches. In both devices, the successfully retrieved 
items are typed out by an attached, machine-controlled 
typewriter. 

From the experience thus far gathered in the opera- 
tion of these machines a number of generalizations may 
be made. 

1) While there is a wide gamut of card-processing 
speeds, their general performance has been quite 
satisfactory. All operate on the site of the in- 
formation processing activities rather than in a 
computation center. 

2) The chief limitations have been in treating, a) 
problems where quantification is important 
(range of boiling points, per cent of each of sev- 
eral ingredients required), b) cases involving the 
conjunction of a number of disjunctive classes 
(e.g., a card is to be scanned to determine whether 
it contains both the code for three American 
states and the code for no French province), and 
c) problems involving detailed interrelationships 
such as sequence, connectivity, subject-object re- 
lationship. These limitations apply to some de- 
gree to each of the electromechanical devices but 
in some cases these limitations have been sur- 
mounted in ingenious fashion. 

3) The cost of these machines, with the exception 
of the more powerful optical film and magnetic 
card devices, has been remarkably low compared 
to the high-speed electronic machines and their 
full-time use for information retrieval has been 
practical. The cost per search, as well as the 
capital investment, has been low. 

4) The rate of search has been limited by card and 
paper tape handling speed. Again, excluding the 
optical and magnetic, the systems have ranged 
from 8-500 items per minute. 

5) The separable unit record has proved to have a 
number of advantages and disadvantages. Among 
the former are the ability to select small subfiles 
for machine feed, the ease of file maintenance, and 
the easier access (in many cases) to original in- 
formation (through film inserts, card drawings, 
etc.). The disadvantages center around the vari- 
ability in information content of documents which 
cause either inefficient waste space on punched 
cards or introduce the difficulties of multiple 
card evaluation and manipulation. It has of 
course been possible to simulate the behavior of 
simple relay and medium electronic computers. 
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6) Experience with optical film and magnetic cards 
has been very limited. The specifications and 
demonstrations have been most impressive. 

Special Purpose Information Retrieval 
Electronic Computers 

We are now entering the era of the special purpose 
high-speed electronic information retrieval computer. 
Only one or two published accounts of such machines 
have appeared, since they are still developmental and 
frequently related to military applications. 

Such devices differ little from some currently an- 
nounced data processing equipment and it appears 
that their usage will be justified only in the largest in- 
formation processing establishments. These machines 
will look very much like a general purpose computer 
but trade unnecessary features ( e.g ., floating point 
arithmetic) for hardware embodiments more needed 
for information retrieval (e.g., a large number of inde- 
pendently operating comparison registers). 

Another “special” unit is a special limited version of 
a standard data processor with certain features elim- 
inated. The belief that this will materially reduce the 
cost of such a machine is not in accord with the experi- 
ence of computing machine manufacturers. 

Special computers are also required to handle special 
problems in information retrieval. One that is receiving 
a good deal of attention is the problem of the storage 
and retrieval of graphical, geometrical, and topological 
configurations. One special computer has already been 
designed for work in this area. Special military informa- 
tion retrieval problems have given rise to the design of 
special machines geared to these problems. 

The following is a description of a nonexistent but 
typical large, special purpose computer. The external 
storage might consist of a large bank of high-density, 
high-speed magnetic tape units which would have the 
facility (under computer control) to advance or back 
up rapidly and then to search not for one “key” but for 
many logical combinations of keys. All tape units would 
be active simultaneously under independent control of 
a special tape manipulation and testing unit. As in- 
formation which satisfies the rough screening condi- 
tions is brought from the tapes, it, would be sent into a 
magnetic core unit where a further refining process is 
carried out. Information which successfully meets all 
the criteria for one of the many simultaneous searches 
to be conducted would be sent to the proper output area 
for both immediate visual display and for subsequent 
complete printing of full information on high-speed 
equipment. 

The hypothetical computer just described represents 
only one direction in the development required. We 
assume that any good information retrieval system is 
sufficiently flexible to meet the logical requirements of 
the searching system and to search at sufficiently low 
unit cost. Beyond these, the two most critical factors 
are investment cost and the size of the document col- 


lection that can be manipulated in some reasonable 
time. The typical relay card-selecting device represents 
relatively low investment but with correspondingly small 
collections manipulable in practical times. The typical 
search program running on available electronic com- 
puters represents a high investment cost with moderate 
sized collections searchable in practical times. The 
special purpose electronic computer, such as the hypo- 
thetical one described above, represents an attempt to 
obtain increased performance at an increased invest- 
ment cost. Thus, we see that no major breakthroughs 
have occurred to date. 

The challenge to the computer designer and to the 
system designer is the development of techniques for 
handling large collections on relatively inexpensive de- 
vices. It is hoped that the developments in computer 
components, computer logic, storage devices, and sys- 
tems operation will lead to the development of im- 
proved devices. Hand-in-hand with such development 
must be the maturing of our understanding of the the- 
oretical and practical bases for the retrieval of informa- 
tion. 
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A Specialized Library Index Search Computer 

B. KESSELf and A. DeLUCTA; 


Introduction 

T HE NEED for mechanizing library information 
searches has become apparent during the past two 
decades. The phenomenal increase in the size and 
number of library establishments in conjunction with 
the requirement for greater speed in servicing informa- 
tion requests have been key contributory factors in fo- 
cusing attention on this situation. A large variety of 
developmental and commercially available devices have 
been used for putting library card catalogs into forms 
more amenable to automatic searching. It was only a 
matter of time before many researchers became aware 
of the advantages of digital computers in mechanizing 
this operation. Government and commercial organiza- 
tions, by writing new programs, were able to adapt 
those computers which were already available to them. 

Early in 1958, a program was initiated by the Rome 
Air Development Center for the design and fabrication 
of an Index Searcher that was to be tailored specifically 
to the needs of library documentation. The Index 
Searcher was to contain only those logical functions that 
would be of value in library information searching. The 
Searcher was to be developed primarily as a research 
vehicle for use in studying various index and informa- 
tion retrieval approaches. The basic design was to in- 
clude facilities to allow the Searcher to be used as a 
fully operational device. In April, 1958 a contract was 
awarded to the Computer Control Company with de- 
livery of the Index Searcher to occur early in 1959. 

Design concepts of the Index Searcher have evolved 
in part from a knowledge of the Minicard Selector. 1 
However, the Index Searcher will be used in library 
situations where index data and graphics material are 
stored separately, rather than together as on Minicards. 

The Index Searcher will search through large volumes 
of index data serially and print out the identification of 
those documents, reports, or graphic materials that 
satisfy the requirements of the search criteria. A con- 
sideration of the problems of library mechanization indi- 
cated that the following features should be included in 
the Searcher design: 1) high-speed searching of the index 
data; 2) the ability to reproduce all or part of the index 
library cheaply and quickly; 3) a minimum of delay in 
effecting the search beyond the setting up of the search 
criteria; 4) the capability for handling a wide variety 
of index and classification schemes; 5) ease of opera- 
tion; 6) flexibility in permitting frequent updating of 

f Computer Control Co., Inc., Framingham, Mass, 
t Rome Air Dev. Center, Griffiss AFB, Rome, N. Y. 

1 J. W. Kuipers, A. W. Tyler, and W. L. Myers, “A Minicard sys- 
tem for documentary information,” American Documentation, vol. 
8, pp. 246-268; 1957. 


the file; 7) search for more than one question at a time; 
and 8) a growth potential allowing for relatively efficient 
use of the Searcher either singly or in groups as the size 
of the library increased. The system design of the 
Searcher as a special purpose system, was to result in an 
information handling capability that could be matched 
in the general purpose computer field only by a consid- 
erably larger and more expensive machine. 

General Description 

The Index Searcher uses magnetic tape as the storage 
medium for index data. An index entry is made on the 
tape for each document, report, or other piece of phys- 
ical material which is to be made available for rapid 
automatic searching. An entry might consist of any- 
thing from a title to a complete text, depending upon 
the storage and recovery system to be used. In typical 
applications the entry made for a given document will 
consist of a document number, title, author, date, and 
several descriptors that define the subject matter of the 
document. 

The descriptors can be grouped into sets that are 
designated as phrases. The value of this feature can be 
illustrated by considering the indexing of a technical 
paper that describes a machine using transistorized log- 
ical circuits and a magnetic core shift register storage. 
Listing of just the four descriptors for transistorized, 
logic, magnetic cores, and shift registers could result in 
the false selection of this document during a search for 
magnetic core logical circuits. Use of phrase boundaries 
in the proper places assures that the descriptors will be 
properly associated with each other during the searches. 

The standard machine word for the Searcher is 42 bits 
in length. This consists of seven alpha numeric charac- 
ters of 6 bits each. The system also includes provision 
for handling double and triple length words so that it 
can accommodate clear text as well as coded index data. 

A typical document index entry might consist of 
twenty machine words, making a total of 840 bits of 
information. 

Search criteria are specified in terms of question 
words plus logical connectives to group the question 
words into question phrases and to group the question 
phrases into complete questions. The question words are 
stored in the internal memory of the Searcher. The 
memory has a capacity of 20 machine words. 

The type of comparisons to be made between question 
words and the document index entry words is specified 
individually for each word by plugboard wiring. The 
specification can be for “equality,” “less-than,” “great- 
er- than,” or any combination of two of these types of 
comparison. For example, a question might specify that 
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a document is required on a particular subject that 
would be identified by descriptor equality comparisons ; 
published after 1956, identified by a “greater-than” 
comparison ; and with a security classification no higher 
than confidential, identified by an “equal-to or less- 
than” comparison. • 

The question words are grouped into question phrases 
by means of plugboard-connected logical circuits. Fif- 
teen phrase elements are available for composing up to 
15 different question phrases. Two or three phrase ele- 
ments can be cascaded to make a larger phrase than 
can be handled in one logical element. Each phrase can 
use any desired combination of question words, in 
either assertion or negation form, as inputs, and each 
question word can be used in as many of the 15 differ- 
ent phrases as desired. 

Complete search criteria questions are made up of 
question phrases by plugboard-connected select logical 
elements. The same flexibility exists here as in combin- 
ing words into phrases. Ten question elements are pro- 
vided, resulting in the ability of the Searcher to simul- 
taneously search for documents meeting ten different 
search criteria. 

Searching consists of scanning through the complete 
document index tape and comparing the contents of 
each index entry with the question words and logic 
stored in the Searcher memory and plugboard connec- 
tions. 

The result of a successful search is a print-out of the 
document numbers of those document index entries that 
have met the search criteria. An identifying number 
printed beside each document number shows which of 
the several search questions is answered by that docu- 
ment. A picture of the Index Searcher is in Fig. 1. 

Functions 

The Index Searcher has six different modes of opera- 
tion. Listed in the order in which they would be used, 
these are 1) Document Insert, 2) Regenerate Tape, 
3) Question Insert, 4) Search, 5) Print, and 6) Edit. 
Each of these will be described after a brief reference to 
the system block diagram shown in Fig. 2. 

The major blocks making up the Searcher, and their 
functions, are as follows: 

A) Magnetic Tape Unit — stores and scans document 
index data. 

B) Tape Buffer — serves as a buffer to and from the 
Magnetic Tape Unit and Flexowriter. 

C) Flexowriter — serves as a punched-paper-tape read- 
er for document and question insertion, and as 
output printer for searching. 

D) Word Input Buffer — accumulates magnetic tape 
information frames to form complete machine 
words. 

E) Print-Out Buffers — store tape data which is to be 
printed out if the document is a desired one. 

F) Word Storage Buffer — stores complete magnetic 
tape index words for comparison with question 
words. 



Fig. 1— Index searcher. 



Fig. 2 — Block diagram. 


G) Memory — stores question words. 

H) Comparison Circuits — compare tape index words 
with question words from memory. 

I) Plugboard Circuits — use results of word compar- 
isons to make complete question comparisons. 

J) Control Circuits. 

The actions of the Searcher in each of its operating 
modes follow. 

Document Insert 

This mode is used to place new document index en- 
tries into the Searcher’s magnetic tape storage. Pri- 
marily it is a punched paper tape to magnetic tape con- 
version process. New index entries are submitted to the 
machine in the form of punched paper tapes. Paper tape 
frames are accumulated in the tape buffer until the buf- 
fer is full. Those contents of the buffer which represent 
complete index entries are transferred to the magnetic 
tape as one block of tape data. Any partial entry left in 
the buffer is then completed with the next paper tape 
information to arrive, and is followed by more docu- 
ments. The process continues automatically to the end 
of the paper tape. Each magnetic tape block contains 
an integral number of complete index entries. The ac- 
tual lengths of blocks on the tape are variable. 
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Regenerate Tape 

This is a simple magnetic-tape to magnetic-tape rou- 
tine, which allows the file tape to be duplicated as in- 
surance against loss of file data through accidental dam- 
age to the tape on the Searcher. This requires an addi- 
tional magnetic tape unit that is not part of the Search- 
er as originally built, although space has been left for it 
in the racks. 

Question Insert 

This mode transfers question words from punched 
paper tape to the Searcher Memory. This is accom- 
panied by insertion of a question plugboard that speci- 
fies the nature of the comparison to be made for each 
word and the combination of the question words into 
phrases and complete questions. The plugboard con- 
nections can also specify one or two words per selected 
document to be printed out in addition to the document 
number. A simplified symbolic representation of the 
plugboard and its circuits is shown in Fig. 3. 



Fig. 3 — Simplified representation of plugboard circuits. 


Search 

This is the primary operating mode of the Searcher. 
This mode performs the scanning of stored document 
index entries in search of those that meet specified ques- 
tion criteria. The tape can move in either the forward or 
reverse direction to accomplish this search. While the 
tape is scanned, tape frames are accumulated into com- 
plete machine words, and are compared with the twenty 
stored question words. A plugboard word storage ele- 
ment remembers a successful comparison with any of 
these words until an end-of-phrase designation occurs in 
the tape data. At that time the word storage outputs 
are sensed in the phrase element logic to determine 
whether any complete question phrase criteria have 
been satisfied. If so, a phrase storage element remembers 
this fact as scanning continues through the remainder of 
the tape index entry. When the end of tape data for 
the document is reached, sensing the outputs of the 
phrase storage elements determines whether or not a 
complete question criteria has been satisfied. During 
the scanning of the document entry, the Print-Out Buf- 
fer receives automatically the document number and 
two other plugboard-specified words. When a docu- 
ment answers a question, the Tape Buffer receives the 
contents of the Print-Out Buffer and a number identify- 


ing the question answered by the document. This in- 
formation is printed out on the Flexo writer. Further 
searching ordinarily continues during print-out. How- 
ever, if a series of successive selections results in filling 
the Buffer faster than the maximum print-out rate, the 
tape automatically stops until adequate Buffer capacity 
is available for further document data. When the end 
of the recorded portion of the tape is reached, the tape 
unit automatically stops and positions itself ready for 
the next search in the opposite direction. 

Print 

This mode is used to print out entire document index 
entries rather than just the three words possible in a 
normal search. The machine operates in much the same 
manner as for normal searching until a selection is made. 
The tape must move in the forward direction. Selection 
of a tape index entry causes the tape to stop, reverse, 
reread the entire selected block into the Buffer, print 
out the complete selected entry, and then resume search. 

Edit 

This mode is used to delete unwanted index entries 
from the tape. Document entries to be deleted are speci- 
fied by document number or other normal question cri- 
teria. Operation is similar to the Print mode up to the 
point of bringing into the Buffer the block of tape infor- 
mation containing the document to be deleted. 

At this point the block is recorded in the same place 
it formerly occupied on the tape, but with blank char- 
acters in the position which had been occupied by the 
deleted entry. 

Parameters 

The Searcher scans through magnetic tape document 
index entries at an effective rate of 218,000 bits per sec- 
ond, or about 5200 machine words per second, where 
each machine word contains 7 alpha numeric characters. 
For the typical document index entry length of 20 words 
mentioned earlier this amounts to 260 documents per 
second. While searching at this rate the machine seeks 
documents satisfying up to ten independent search cri- 
teria. 

The Searcher uses 2400-foot rolls of one-inch magnetic 
tape for document index storage. One reel stores 
57,600,000 bits, which is about 68,500 twenty-word 
documents. Uninterrupted search time for a complete 
reel is approximately 4.5 minutes. 

The input-output rates of the Searcher are presently 
limited to the 10-character-per-second rate of the 
Flexowriter for document insertion, question insertion, 
and selection print-out. 

Therefore, the document insertion rate, based on 20- 
word entries, is about 4 documents per minute, and the 
selection print-out rate, when printing out 3 words per 
selection, is approximately 25 documents per minute. v ' 
A 20-word question insertion takes about 15 seconds. 
These rates can be substantially increased by use of 
high-speed paper tape reader units and high-speed 
printer or punch outputs. 
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Programmed Interpretation of Text as a Basis 
for Information-Retrieval Systems 

L. DOYLEf 


T WO conditions have made it almost inevitable 
that we have an information-retrieval project at 
the System Development Corporation (SDC). 
First, our internal documentation. It has been esti- 
mated that we acquire 10,000 documents a year both 
from internal and external sources, not including books 
and periodicals. Internal distribution of these docu- 
ments runs into millions of copies. We have not been 
able to afford to abstract and subject categorize more 
than 10 per cent of our 10,000 documents a year. With 
a good retrieval system we might be able to make many 
thousands more of our documents accessible by subject 
without increasing documentation expense. 

Secondly, one of SDC’s major activities is computer 
programming for air defense, and as a result of this we 
have charge of several large general-purpose digital 
computers. 

Thus, we have both the motives and the equipment 
to study the computerization of the handling of docu- 
mented information. And so, about six months ago a 
three-man project was created at SDC to do research 
and development in this area. 

Programmed Selection of Descriptors from Text 

Previously we had considered using, as stop-gap solu- 
tions, some of the already existing well-known informa- 
tion-retrieval formulas, such as Uniterm, or marginal 
punched cards, or peek-a-boo, or something of this na- 
ture; but we thought, “Where are we going to get the 
people to read and categorize 10,000 technical docu- 
ments a year?” We realized that we were already in a 
situation which was going to become increasingly com- 
mon as time goes on — a setting where skilled man-hours 
are harder to find than time on a large computer. 

At this time we switched to a machine-centered phi- 
losophy and began to explore the possibilities for per- 
forming the chores of documentation on digital com- 
puters and EAM equipment. The crucial difficulty in 
implementing such a philosophy is finding a way to use 
computer programs to interpret natural English text for 
the purpose of effectively subject-indexing documents, 
so that they can be retrieved precisely without any per- 
son’s having read them for purposes of categorizing, 
picking descriptors, or encoding in any manner. 

Mechanical Indexing as a Prelude 

Our thinking about the interpretation and retrieval 
of natural text has changed greatly since we started. 

f System Development Corp., Santa Monica, Calif. 


For example, one of our realizations has been that it is 
premature to set oneself up for pure machine searching 
of natural text. Machine searching is superb if you 
know exactly how to describe what you are looking for 
and if you are sure that you know how to choose from 
among many possible searching strategies. I doubt if 
anyone is yet in this comfortable position with respect 
to machine searching of text. What is needed is a search- 
ing setup which is fast and convenient, while at the 
same time allowing the human mind itself, with its 
versatility and its powers of observation, to take part 
in the search. Our solution has been to employ mechan- 
ical indexing using an artificial language, which I shall 
describe later. 

The System 

We now have an experimental abstract searching sys- 
tem at SDC which was set into motion by our retrieval 
project. Through the use of it, we hope to find out the 
basic difficulties of natural text retrieval and to come 
up with new principles of language data processing, 
some of which may be useful for purposes other than 
information retrieval. It is also a research tool, through 
which properties of language and of collections of infor- 
mation may be subjected to analysis- — for example, by 
making frequency counts. Later I shall describe an in- 
stance of the use of the system for research. 

Fig. 1 shows the process we now use to encode ab- 
stracts. At the left is keypunching of the text, which in 
itself tends to undermine the purpose of natural text 
retrieval— however, we assume that input technology 
will develop to the point that keypunching will no 
longer be a barrier. For the present, key-punching lim- 
its our input to small chunks of text. We now work only 
with abstracts; however, we can handle any fragment of 
text of about abstract size (approximately 100 words). 

As the text material arrives in computer storage (as 
6-bit Hollerith), the text-compiler program translates 
the raw text into a more condensed form suitable for 
searching. There are two stages of this condensation. 

First, selection of subject terms from the text. The 
text compiler has at its disposal a table of allowable 
subject words, and it searches text for these words. 
Secondly, all of the subject words or terms which the 
text compiler finds in an abstract are replaced by bi- 
nary numbers which have been predesignated to repre- 
sent the subject words when they are stored on tape. 
These' binary number tags take up less than a third as 
much space in storage as would the subject words 
themselves. 
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Fig. 1 — Conversion of text to condensed code for searching. 


By these means the text compiler will condense an 
abstract from about 130 storage registers of text down 
to about 5 or 6 registers of binary numbers. With this 
condensation one can represent 10,000 abstracts on 
about 5 per cent of the total length of a standard IBM 
tape reel. We now have a tape layout which contains 
the boiled-down essential information of many ab- 
stracts, and any time we wish to make reference to these 
abstracts, they can be read into computer memory at 
the rate of 500 per second. The text compiler can proc- 
ess 4 abstracts per second. 

At this point we have our choice of two courses of ac- 
tion. We can either machine search the tape for particu- 
lar combinations of subject words, or we can convert 
the entire tape to an alphabetical code, which can 
be printed out as an alphabetized format and searched 
by eye. 

Machine Searching 

If one does decide to machine search this tape, it can 
be done in a very short time. With a large digital com- 
puter, such as IBM 709 or the AN/FSQ-7, the tape- 
stored abstract plots can be searched much faster than 
they can be read in. Since both these computers can 
execute instructions while in-out operations are going 
on, it is possible to conduct 100 simultaneous logical 
product searches during the 20 seconds it would take 
to read in 10,000 abstracts. 

The Human Mind as a Searching Instrument 

However, at this point in our development we value 
searching flexibility much more than we value speed or 
efficiency. Speaking of speed, it is still difficult to dupli- 
cate mechanically the feat of a person looking up a 
number in a telephone book. Let’s consider this point. 
The Los Angeles Central Zone Directory has enough 
alphanumeric content to fill six IBM tape reels. But al- 
most anyone is capable of finding an entry in this direc- 
tory in from 10 to 30 seconds. This illustrates the power 
of familiarity with alphabetical order, which is a typical 
(and therefore unappreciated) human ability. If you 
have about seven RAMAC’s you might just be able to 
exceed this common everyday performance by humans. 

If you now imagine a whole room filled with tele- 
phone-directory-sized books, each of which contains en- 


tries confined to a small portion of the alphabet, it is 
easy to see that it would take some person less than 
twice as long to find an entry in these many books as it 
would take to find something in just one book. Of 
course, for this to be possible, all the books have to be 
stacked in alphabetical order. We now have a very 
cheap searching mechanism, which no computer can 
equal in speed for such a large volume of material. There 
is, of course, the cost of publishing all these books to be 
considered, but when such things as microfilm scanners 
are available, the possibilities for exploiting human 
familiarity with alphabetical order are something to be 
seriously considered. 

Another human skill of importance in searching is 
the ability to judge the meaning of a symbol by its con- 
text. It will take a great amount of programming and 
even more preliminary research before this skill can be 
challenged by electronic instruments. 

From the standpoint of research and development, 
the most important thing about humans as searching in- 
struments is that they are observant. They repeatedly 
notice things that they are not programmed to notice. 
At this stage, our progress strongly depends on the ap- 
plication of these powers of observation. Our intent is to 
use astute people to search small document collections 
in order to find out how computers should be pro- 
grammed to search large collections. Our hope is that 
the state of the art of programming to produce highly 
condensed, information-rich formats for search by the 
human eye will improve at such a rate that pure ma- 
chine searching of natural text may not for many 
years catch up in convenience or effectiveness with 
computer-assisted eye searching. 

Our present alternative to machine searching, which 
is illustrated in Fig. 1, is the generation of an alpha- 
betized printout consisting of two letter code words for 
subjects. So, instead of using the binary subject num- 
ber tape as input to a searching program, we feed it to 
another program which converts all the binary num- 
bers to two letter symbols, which we call bigrams. The 
conversion process is totally analogous to binary-to- 
decimal conversion, the only difference being that in- 
stead of subtracting powers of ten, we subtract powers 
of 26, the number of letters in the alphabet. These bi- 
grams are punched out on regular IBM cards, after 
which EAM equipment can offset reproduce and alpha- 
betically sort the cards prior to the printing out of a 
format. 

Why do we use bigrams for our alphabetized print- 
out instead of octal numbers, decimal numbers, or for 
that matter the original subject words themselves? One 
answer is that we get greater condensation. We can 
store all the subject words in one abstract on one IBM 
card. And we are now in a position to use EAM repro- 
ducing and sorting equipment to alphabetize on every 
subject word in every abstract. Fig. 2 shows how the 
reproducing is done on the contents of one abstract. One 
can see from this that if one has an abstract containing 
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( ONE CARD PER\ 

^SUBJECT TERMy 

Fig. 2 — Cyclic permutation of code symbols to 
produce index entries. 


n subject words, then this EAM reproducing process 
will yield n cards, each beginning with a different sub- 
ject word, and each constituting an entry in an index. 
Each entry carries, along with the indexing word itself, 
the remaining subject words in the abstract, which act 
as context for the indexing word. 

Whenever I start explaining bigrams to people, their 
reactions convince me of the undying popularity of in- 
dexes which are in plain English. Nevertheless, I main- 
tain that no spoken or written language was designed 
with alphabetized indexes in mind, and am convinced 
that special indexing languages, understood and used 
by librarians and other specially trained people, will 
play an important part in information retrieval as soon 
as the programs are available to translate from English 
to these languages. 

A Better Artificial Language 

Our present system at SDC is an adaptation from 
something that started out to be a pure machine search- 
ing system, and therefore it has some of the vestigial 
organs and imperfections of an evolved creature. One of 
these imperfections is that the bigrams are arbitrarily 
assigned. As long as we feel the necessity to use an arti- 
ficial language, we might as well try to derive one which 
is tailor-made for indexing. What properties should 
such a language have? 

1) All words should be short and uniform in length. 
This is the only property of the four I am going to dis- 
cuss which our present bigram code actually has. Short- 
ness and uniformity are desirable not only because of 
convenience in manipulating the language on EAM 
equipment, but also because more information is 
brought onto one page, which aids eye scanning. 

2) No synonyms. The importance of not having 
synonyms is that it makes it possible for most searches 
to be satisfied by proceeding to one page or region of the 
index. Skipping around will be necessary only when one 
wishes to follow association trails to related topics. 

3) Relationship between meaning and alphabetical 
order. This, I think, would be one of the handiest fea- 


tures of any artificial language especially designed for 
indexing. It will greatly simplify the searching, it will 
make the artificial language easier to learn, and it will 
allow generic searching, the lack of which is probably 
one of the major disadvantages of natural text retrieval 
based too closely on the natural language itself. 

4) Quantitative dependence of assignment of English 
meanings on the contents of the library. A document is 
retrieved from a library, but even so it is not obvious 
that the contents of the library should be as important 
a factor as the contents of the document itself in affect- 
ing the way the document is encoded as a search item. 
In our artificial language, the nature of its correspond- 
ence to English words ought to be governed by how 
much of what is in the library. Application of this prin- 
ciple is difficult because it must certainly involve a 
means of frequency-counting everything in the library. 

But the benefits are many. One possible benefit is that 
the state will be approached where words of the artificial 
language will be equally used, which of course is an in- 
formation theory ideal. It is important to apply this 
ideal if one is to get predictable effectiveness from a 
searching system. The equal use of artificial words tends 
to lead to equal retrieval precision for all documents 
and also has the effect of insuring that all parts of an in- 
dex will be equally used. I have seen several automat- 
ically generated word indexes, and they all suffer from 
the presence of very large blocks of entries beginning 
with the same word. Their heterogeneous structure 
causes about half of the space to be taken up with 
blocks which are seldom used because they are so large. 
There are a great many arguments in favor of isotropic 
indexes, where entry blocks are similar in size regardless 
of the subject. 


Library Analysis 

One of the general aims of our project is to develop 
means of analysis of very large collections of verbal in- 
formation. We think of a collection of documents as 
having an inherent structure, which is not affected by 
physical rearrangement of the documents, a structure 
which, incidentally, we are fully able to probe only with 
the aid of fast computers. Fig. 3 shows how significant 
the structure of a document collection can be. 

One question in my mind has been : Is it possible for 
a computer program to determine what is a subject 
word without having available any subject- word table 
or dictionary prepared by some human? In other words, 
do subject words have distribution characteristics with- 
in a library that a computer program can detect, there- 
by permitting distinction from nonsubject words? The 
data in this figure indicate that the answer is very prob- 
ably “yes.” 

The dictionary which we now use as input to our text- 
compiler program contains no nonsubject words, but 
it does contain some borderline cases like the word 
“time,” which many people would regard as too general 
a concept to be a good subject word. “ADC” on the 
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FREQUENCY OF 
OCCURRENCE 
IN 600 

abstracts: 

ADC = 40 
TIME = 43 


Fig. 3 — Probability of highest observed correlations 
as a test for subject words. 


other hand is a good solid subject word, standing for 
“Air Defense Command.” Our alphabetized index en- 
ables us easily to select the words which correlate most 
highly with these two topics, and we can calculate and 
compare probabilities. 

One can reason that a good subject word should have 
certain other words which co-exist with it in the same 
documents with a frequency much greater than expect- 
able from chance distribution, but that nonsubject 
words, which are likely to be used by anybody writing 
about any subject, should not have high correlations. 

To test out this notion I calculated the probabilities 
for the highest observed correlations occurring by 


chance, both for “ADC” and for “time,” given that all 
words in the library are randomly assigned to docu- 
ments. Of course, the words are not randomly assigned, 
so I got some very low probabilities. As Fig. 3 shows, 
the ADC correlations are very much more improbable 
than those for “time.” 

Now these correlations will become weaker for any 
word, subject word or otherwise, as they are present in 
fewer documents. This means that in order to apply any 
sort of correlation test to select subject words, one has 
to make allowance for the frequency of the word. Un- 
fortunately, the correlation test will fail altogether when 
a word is present in only 3 or 4 documents. One has to 
have a large enough sample. Also author biases in lise 
of common words could conceivably cause many non- 
subject words to pass a correlation test. But, for- 
tunately, as collections increase in size these effects 
should become less important, and it is in very large 
collections where this sort of methodology will be 
needed. 

Conclusion 

As a final comment: libraries and other collections of 
written information can be thought of as realms of na- 
ture, subject to scientific observation. Science brings 
valid simplicity to that which is apparently compli- 
cated, and it is hard to find anything more complicated 
than masses of ideas recorded on paper. And so I make 
explicit an idea which I hope has been implicit in this 
presentation — that general-purpose computers of today 
give us the opportunity to apply scientific method to 
uncover the principles of the nature and use of informa- 
tion in order that we may put to better use the vastly 
more powerful computers of tomorrow. 


A Theory of Information Retrieval 

CLINTON M. WALKERf 


T HE mathematical formula which best describes 
my conclusions from reading the literature on in- 
formation retrieval (IR) is the following: 

4 U OK 4 $ = ET 

For you, better for dollar equality. 

This states that a more economical approach for or- 
ganizations interested in information retrieval might be 
collectively to support as an information retrieval center 
some nonprofit organization, such as SRI or SDC. Such 

f Hughes Aircraft Co., Culver City, Calif. 


an organization could be a center for receiving and dis- 
semination of up-to-the-minute retrieval literature of 
organizations concerned; could advise on the practica- 
bility of certain undertakings; and could perform ex- 
periments in the field of IR. 

Aside from this one equation, formulation should 
proceed from basic principles. Perhaps the most basic 
of all principles is that meaning, rather than informa- 
tion alone, needs to be retrieved. Just how does one 
produce or obtain meaning? Take the example of a 
small child. All a child knows at first is himself. He gets 
acquainted with his hands and feet, and then with his 
near associates by relating them to himself. He gradu- 
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ally learns to classify things in terms of roundness, 
which things he might call a ball; in terms of use, such 
as food. New things learned are related to things al- 
ready known. For example, at an early age, any man 
might be classified as “daddy.” Throughout his life, 
meaning is obtained by relating what is familiar to that 
which is unfamiliar. 

We might christen this process the “Mew-Mew” 
theory of meaning. Each of us, as a “me” looks at some- 
thing else as a “you,” which we interpret in terms of the 
“me” or what is known, but which we might also take 
back into the relative “you” for objective evaluation. 

This process of classification is an operational way to 
produce meaning. A language, in effect, classifies nouns; 
descriptions of “which,” “what kind of,” and “how 
many” have meaning when related to other nouns. What 
the nouns do — and how, when, and where they do it — 
has meaning when related to what other nouns might 
be doing. 

So, an operational language is one in which classifica- 
tion takes place in familiar areas or domains. In these 
domains, dictionaries can be constructed of key nouns; 
definitions can include relationships to other key nouns 
within the area. For retrieval purposes, reference to a 
key noun could have a built-in potential reference to 
other key nouns, thus providing a built-in meaning 
potential. The prospects are exciting. But, before we de- 
velop the idea further, let us lay down some basic postu- 
lates. 

We can set up a number series as a set R of objects 
called nouns, with the relationships defined by three 
operations denoted by S, II, and /. Concomitant with 
this set is another set M whose members can be derived 
from certain operations on the set R. The symbol 
— > means “results in a relationship of,” or “implies that” ; 
the symbol “ + ” means “and”; the symbol “ — ” means 
“not”; “( )” are used in the usual enclosure sense. An 
IR specific operation is one denoted by the symbols 2, 
n, or /. An IR nonspecific operation is any other opera- 
tion in real or complex variable theory. We will assume 
that IR nonspecific operations will follow the manipu- 
lative rules of real and complex numbers for IR specific 
operations. For example, the operations are additively 
commutative. 

(Aj B) + (CJ D) = ( CfD ) + (AfB) 

(ASB) + (CSD) = (CSD) + (ASB) 

( AILB ) + (CIIP) = (CIIP) + (ATIB), 

The operations within the parentheses are IR specific; 
those between the parentheses are IR nonspecific. Addi- 
tional postulates are required for defining the operation 
processes in an uncompleted operation. The following 
postulates and definitions are offered for consideration. 

Definition 

The domain of A consists of all subcategories and sub- 
sequent subcategories under A. 


A fB states that a word, A, which is classified in cate- 
gory B is put in a relationship such that A is in a hier- 
archy less than that of B, and that the domain of A in- 
cludes not more than the domain of B. That is, A is part 
of B. 

Postulate 1 

a/bJc — » AfC 

states that a member of a subcategory is also a member 
of a category; for example, shoelace is a subcategory of 
shoe, which is a subcategory of clothing, which implies 
that shoelace is also a subcategory of clothing. 

Postulate 2 

AfB -► BfA 

means that a category cannot be a member of a subcat- 
egory unless it is the only member. True, in ordinary 
language, sight can be thought of as a subcategory of 
sensing and perhaps sensing at the same time can be 
thought of as a subcategory of sight; but, for the con- 
venience of constructing an unambiguous dictionary, we 
can exclude this possibility until such time as we find it 
absolutely required. Thus, we shall construct a dictionary 
in a domain with rigid hierarchical relationships among 
nouns. If later, we want to relax this requirement, we 
may find some interesting experiments available in the 
realm of “machine thought processes.” 

Definition 

AhB means that A is synonymous with B. 

Postulate 3 

(AfC) + (ASB) — BfC 

means that synonyms within an area are necessarily 
members of the same category. 

Definition 

AttB means that A and B are related by means of the 
characteristics of some domain. We shall call these 
words “relatives.” 

Postulate 4 

(AUB) + (AfC) BfC 

means that relatives are subcategories of the same cate- 
gory. 

Definition 

M is a set of elements of meaning derived by cate- 
gorizing two or more elements of R at the same time. 

Postulate 5 

(BSC) + A f(B + C) — » (AfB) + M = (A fC) + M 

means if B and C are synonymous, and A is a common 
subdivision of both of them, then the classification of A 
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into B and A into C simultaneously adds meaning to 
one of them, but it would be redundant to use both 
classifications. 

Postulate 6 

(AUB) + (A + B)fC = (AfC) + ( BfC ) + M 

means that when relatives A and B are, together, classi- 
fied as members of C, they contain an element of mean- 
ing which is not present when they are separately so 
classified. 

Postulate 7 

(£IIC) + Af(B + C) -» AfB + AfC + M 

means that to categorize a subdivision of two relatives, 
B and C, is to add meaning to both of them. 

We have, by these postulated operations, created a 
language of classification — an operational linguistics 
which should be compatible with operational mathe- 
matics. Hopefully, a classification of nouns accessible by 
data processing equipment can relieve the information 
seeker of the trouble of searching the entire haystack 
for his needle of information and thread of meaning. 
Purposely sacrificed is the richness of redundant normal 
language in favor of the more important feature of ex- 
actness. Not only do we attempt to be more exact, but 
also to minimize ambiguity, to allow easy translation of 
concepts, to assure objective criteria of meaning, and to 
provide a basis of agreement in discrimination. 

Postulate 1 tells us which words can be classified in 
a given domain. Postulate 2 prevents common words 
from being counted in esoteric categories, unless they 
are subsumed under those categories. Postulate 3 per- 
mits the counting of synonymous words in the same fre- 
quency tally. Postulate 4 permits the discovery of alter- 
native paths for continued search. Postulate 5 permits 
singling out of the representative path among equivalent 
paths to be followed. Postulate 6 shows that two words 
are more significant if the context does classify them to- 
gether. Postulate 7, finally, shows that paths which 
originally diverge become significant upon reconverg- 
ence. In all those postulates which have symbol M as an 
added element, significance is increased since M repre- 
sents meaning and meaning is of prime importance in 
transference. 

In any system of information retrieval, there are 
factors of cost, speed, and power. These three criteria 
can be used to determine, under a given circumstance, 


which of several alternatives is to be preferred. In many 
instances, the major purpose of the retrieval system is 
to perform a rough scanning job for a literature searcher, 
to determine for him whether a particular document is 
worth further reading. Often the author can furnish, in 
addition to his name and topic, a list of his main ideas 
and purposes. He might even estimate a degree of cor- 
relation between the concepts embodied in his document 
and a list of key nouns in its general area. 

To apply the power criterion, the machine or human 
doing the segregation of valuable from useless informa- 
tion can be simulated by a filter separating relevant mes- 
sage from total signal. Assuming homoscedasticity and 
linearity in the specified direction, a function 

F = 2 (y — bx ) 2 

can be constructed, the parameter b minimized by least 
squares, and a Pearson correlation coefficient, r, ob- 
tained between simulated relevant message and simu- 
lated total message. The parameter, b, would represent 
the error term between, for example, time and ampli- 
tude. An autocorrelation can also be performed min- 
imizing the error between message power and an ampli- 
tude-attenuation factor representing noise. 

With power evaluated, we can set whatever bound- 
aries we desire as to speed and cost and make our 
choice by linear programming. 

An example of a low-cost, high-speed retrieval system 
with fair retrieval power is one based on the key nouns 
with which an author titles his document. Other key 
nouns are likely to be found in the same sentences as the 
title key nouns ; therefore, the searcher, machine or hu- 
man, can reduce the volume of the document to a de- 
sired degree of abstractness by selecting the frequency 
and location of the sentences containing these title 
nouns which he wishes to extract. A simple experiment 
was performed by the author, using as an abstract the 
first sentence containing a title noun in each major sub- 
division. Questions pertaining to the documents con- 
cerned were asked participants in the experiment, some 
of whom had read the author’s abstract; some, the ab- 
stract of key words; and some, the entire document. 
Results of the scoring were roughly comparable for the 
three categories, for equal reading time. The experiment 
itself is not so important except as an illustration of the 
power of the use of key words. Properly categorized, the 
use of key nouns could become an effective means of 
speedy, powerful, and, in large volume, relatively inex- 
pensive information retrieval. 
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The Role of USAF Research and Development in 
Information Retrieval and Machine Translation 


ROBERT F. 


Introduction 

ripHE United States Air Force has numerous and 
I varied types of data handling problems. This 
paper reviews some of the developmental ap- 
proaches and contributions that the Air Force has made 
toward the solution of semantic-graphic information 
handling problems. Some of the interesting problems 
encountered in development of techniques and equip- 
ment in this field are presented. 

Background History of Rome Air Development 
Center Efforts in the Field of Information 
Retrieval and Mechanical Translation 

In the past four years the Intelligence Laboratory 
of the Rome Air Development Center (RADC) has 
learned, through trying experiences, how to get the re- 
quired movement in this data handling field. Under- 
standably there are many approaches or philosophies, if 
you will, of how to develop the right synthesis of index, 
logic, hardware, etc., for any particular informational 
retrieval solution. The same can be said of mechanical 
translation (lexicon-logic and hardware). If we allow our 
minds to review these years and look at the situation as 
it was when we began our efforts, without today’s vast 
knowledge of hindsight, I believe our approach would be 
quite similar to the one we took then. We would see the 
information retrieval problem growing at a staggering 
rate. The linguistic side was getting some attention, but 
the hardware, very little. The need and requirements 
for the Air Force were there and all that remained was 
to gather funds, select approaches, and secure contracts. 
I presume you recognize the humor of the previous sen- 
tence. 

At that time the Air Force started to lend support to 
various projects already underway as well as to initiate 
entirely new work in this field. We knew the field needed 
much development effort, and involuntarily the Air 
Force took on the role of “catalyst” in information 
retrieval and mechanical translation developments. 
Note that I am not saying we were first with most; 
indeed not— we slipped into a “role” that was important 
to the Air Force and I believe it has done justice to the 
problem of both information retrieval and mechanical 
translation. You will note that I imply the existence of a 
common problem area in my use of the term “both in- 
formation retrieval and mechanical translation.” In- 
deed, with the exception of the problem of physically 
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handling documents and their contents, the Air Force 
Research and Development (R&D) program has been 
based on the premise that R&D effort in these two areas 
should be mutually cooperative. To illustrate this part 
before passing on: it gives a good return for effort ex- 
pended because the two fields are interrelated, and ad- 
vance in one usually means advance for the other. For 
example, if we were interested in information storage and 
retrieval alone, the Mechanical Translation (MT) field 
would be suffering for lack of a high-density storage that 
now seems quite practical. They “complement” one an- 
other from a development point of view, not only in 
hardware as mentioned but also, and perhaps more im- 
portantly, from the study of the rudiments of language. 

Some Contributions by RADC to the Large- 
Scale Information Retrieval Problem 

Several years ago RADC could not begin to say what 
type of development catalyst was needed. It could have 
been in the form of heat generated from “blowing off 
steam” about the “vast amount of data that must be 
handled” or it could have been in the form of a stim- 
ulating hardware development acting as a catalyst in- 
serted into all the ingredients and by “stirring around to 
bring about enough agitation to get something done in 
the field.” As mentioned in literature, the old cliche of 
bewailing the fact that we are being overwhelmed with 
vast amounts of data and consequently develop only 
half-vast ideas, was not all correct, although I remember 
using the expression more than once. We accepted the 
approach of getting “something” underway and in so 
doing we became a doer in the field as well as the cause 
of the needed catalystic actions. From the start we real- 
ized we would have to accept the empirical approach; 
by this I mean a single superior approach was lacking. 
We accepted the empirical approach not in total ignor- 
ance, for we knew if one was to develop working tools, 
theoretical analysis alone would be of little help. In our 
search for new storage media in the field of information 
retrieval, we came across a high storage density, equip- 
mental technique which, when coupled with high read- 
out rates, could well be the answer to a practical and 
economical MT look-up or dictionary device. There was 
one storage medium known at that time that had pos- 
sibilities of handling densities in the order of 10 6 bits per 
square inch ; this was the work of King and Ridenour in 
the use of photographic emulsion on glass disks. The 
work that followed is now history — the disk photoscopic 
memory, handling 3X10 6 bits/square inch, was made 
feasible, providing us with an extremely valuable empir- 
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Fig. 1. 


ical tool for further research in both information and 
retrieval and MT. This is an example to illustrate the 
point mentioned earlier, of getting better value in de- 
velopment investment when a development group has 
interrelated fields. While I am discussing the develop- 
ment of the photoscopic memory, I would also like to 
illustrate an interesting point. This is of particular in- 
terest because it illustrates a sometimes neglected point 
in developing an equipment that is dependent on a new 
technique. Referring to the photo disk memory, the 
equipment necessary to produce a disk was considerable, 
but of course necessary, if one was to get a high-density 
storage medium (see Figs. 1 and 2). 

These two pieces of equipment by themselves do not 
represent all the necessary capability required to make 
a disk, but do show quite clearly the development in- 
volved in reaching a goal of practical and economical 
storage. The point here is that development in these two 
adjacent fields does not require only development of 
data handling equipment per se. It requires development 
of all those components that have anything to do with 
the creating of the media. Actually, the development 
breakthrough here in terms of what had to be done to 
produce the required density, was not the disk itself ; al- 
though this is the end product, it was the precise com- 
ponents that allowed us to make this disk from the raw 
data on the tapes, thereby providing a facile method of 



Fig. 2. 


trying many types of stored data. This is not unusual 
in development programs of this type, but it is the un- 
heralded side. In terms of engineering toil, it represents 
70-75 per cent of the work and a substantial portion oi 
the development dollars. Feasibility is a wonderful ex- 
pression but a tricky term when it comes to develop- 
ment work. It was feasible to reduce a “bit” in terms of 
laboratory tests — the emulsion always had the resolu- 
tion — putting the emulsion on optical flat glass had 
been done — reducing the bits to concentric tracks to 
disk to show feasibility for a digital store — all this then 
is “laboratory feasibility,” and the cost is quite in- 
significant when compared to the cost to reduce many 
millions of bits in a unit of time on a disk at the ac- 
curacy required. This had to be done precisely and 
accurately, and peripheral equipment had to be de- 
signed and constructed to make the photoscopic memory 
workable. The first set has been fabricated and improve- 
ments are now underway to perfect the disk-making, 
equipment. The electronic logic used for reading in and 
out of the photoscopic memory comprise the “other 
half” of the development. 

Now as to the empirical approach in information re- 
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trieval. Although it seems that a device would exist that 
allowed compact physical storage and efficient retrieval 
for large-scale document libraries, such was not the 
case several years ago. Today after some doing instead 
of speculating, several methods exist. I shall speak of 
three RADC equipment developments that cover con- 
ceptual voids in the field of storage and retrieval. These 
developments can be broken down into two general 
categories, both of which depend on environmental and 
operational conditions when selection is made. 

Category 1 — The separation of the index from the 
text. Defined, it is the index of the document removed 
from the document itself so that the index search is 
separate from the physical document. The physical 
document is retrieved by an identification number as a 
subsequent operation. 

Category 2 — The combination of the index with the 
text. The index will also produce the physical document 
when selection is made. 

Under Category 1, at RADC we have the Magnacard 
Development and the Index Selector. Both these devel- 
opments come under the heading of technical develop- 
ment, which means in reference to these particular 
equipments that we are developing “working” tools for 
experimental use at RADC. In the case of Magnacard, 
the storage medium is magnetic material deposited on 
segmented tape, IX 3-inch plastic cards. Engineers at 
RADC feel that Magnacard has excellent potential for 
files that require high-speed extraction of information 
and also where ease of updating and extensive file manip- 
ulation by categories is required. 

The Document Data Index Set or the Index Search 
Computer for specialized library as reported in another 
paper at this conference by Ben Kessel of Computer 
Control Corporation, is an Index Searcher designed for 
library mechanization. It searches a large volume index 
data and prints out the identification of the document- 
graphic material, etc., that satisfies the search require- 
ments. The Index Searcher uses continuous magnetic 
tape as the storage medium and the scan is serial in 
fashion. 

Under Category 2, that is, index and document text 
stored together, we find the Minicard program. As one 
would suspect, this philosophy is based on usage with 
extremely large files; and, aside from its ability to per- 
form random search, it of course reduces file space and 
bulk document handling problems considerably. We at 
RADC consider this as one of the outstanding examples 
in empirical development approaches, and state with- 
out reservation that this technical accomplishment is 
unsurpassed in the storage and retrieval field. Inci- 
dentally, this development has now reached a point 
where one can say, “It works.” It is our sincere hope 
that large-scale empirical data will be obtained by its 
application that will give still further impetus to storage 
and retrieval development. Also at this point, it might 
be of interest to those in this field that achievements 
of this kind do not come easily, and I am sure designers 


and engineers in this field realize fully that 4J years is 
certainly a short period of time to develop an aggregate 
of ten complex equipments having many thousands of 
interrelated problems involving optics, emulsions, mech- 
anisms, and electronics. 

What does all this mean? It means we have mecha- 
nized library equipments that will simultaneously give 
improved operations and serve as tools by which we 
can experiment with various known library languages 
and in a relatively short time show the hidden problems 
in these index schemes themselves. It will also be quite 
natural to design the index around the logic and struc- 
ture of the tool. We can prove the worth of indexes by 
constant evaluation while building a file. 

I was asked to include' in my paper all the work being 
done by RADC in the field of information retrieval and 
MT. In this respect I would like to mention that we are 
very much involved in the field of character recognition. 
This interest at first came about through the input 
problems associated with MT and subsequently con- 
sidered for all input problems in data handling such as 
auto indexing, abstracting, etc. We have sponsored a 
development model which reads one English type font 
including numerals, both upper and lower case letters, 
space, and punctuation. We also are under way in de- 
veloping a Cyrillic character reading machine which 
will give the MT field a tremendous boost in cutting 
down the transcription cost. 

New York University has recently completed the first 
phase of a study for RADC on Russian printing matter. 
This study included such problems as the variety and 
frequency of Russian type faces and sizes in current use ; 
the reflectance data of the printed type, the reflectance 
data of the Russian paper, the absorption and reflect- 
ance data on inks used in Russian printing, the predomi- 
nant method of printing, and also the frequency of print- 
ing errors. 

RADC is also doing other work in the MT field be- 
sides developing hardware. A contract with the Uni- 
versity of Washington has brought forth a lexicon in 
the order of 500,000 words with Russian as the “source” 
language and English as the “target” language. These 
words will be used on the photo memory of the me- 
chanical translator. RADC scientists are also aiding 
others in supporting the very interesting work of Dr. 
Oettinger at Harvard in linguistic work in producing 
scientific dictionaries automatically. We are also sup- 
porting the longer range efforts of the Cambridge Lan- 
guage Research Unit of Cambridge University. This 
research centers about the use of logical methods 
utilizing the thesaurus approach in obtaining a trans- 
lation breakthrough in the multiple meaning problem. 
Here thesaurus 1 means “an organization of word usage 
in an ordering dependent on logical content (rather 

1 Report on the work of the Cambridge Language Research Unit 
for the National Science Foundation prepared by Gilbert W. King 
dated July, 1958. 



Samson: USAF Research in Information Retrieval 


69 


than on alphabetic content as in a dictionary).” These 
two efforts are supported jointly with the National 
Science Foundation. 

RADC scientists are also aiding in the support of the 
Research Group of the Center of Studies on Linguistic 
Activity and Cybernetics, University of Milan, Italy. 
This work is a continuation of the research studies per- 
formed on mental operation and semantic connections. 
The Research Group is pursuing the approach that man 
has fundamental order in his thinking process and that 
these are elements of a correlational net. Taking this 
correlational structure of thinking and mastering the 
semantic connections which link the input and output 
language within this structure, they believe, will be a 
solution to some of the more difficult problems in 
mechanical translation . 2 

We have a development that is completed and al- 
though it is classed in the field of information dissemina- 
tion, we mention it here because it is used in association 
with storage and retrieval devices. We feel that dissemi- 
nation exists as an important problem in the continuous 
flow of data in the field of data handling. This function 
can be automatized; the equipment referred to is the 
automatic disseminator jointly developed by RADC 
engineers and Magnavox Research Laboratory. The 
disseminator determines what groups are qualified to 
receive a given document and controls the production 
and addressing of copies so as to insure that the qualified 
groups get their copies quickly. The disseminator must 
determine on the basis of the subject and geographical 
area of coverage of a given document who is qualified 
to receive a copy of that document. The disseminator in- 
put, as used in one case by RADC, is the flexowriter 
tape that was used in the Minicard camera for control 
and code input. The information on the tape is com- 
pared to the stored requests in the disseminator as 
stored in a magnetic drum. The output is tape that con- 
tains control data for manufacturing duplicate Mini- 
cards based on a match in the disseminator. 

2 S. Ceccato, “Mechanical translation,” Automaz. e Automat., 
p. 1, April, 1958. 


Some Requirements of the Future 

After this cursory review (and I hope some insight) in- 
to information retrieval and mechical translation devel- 
opment efforts of the RADC, we come to a question of 
what lies ahead in these two fields. Before I go too far in 
this direction, I would like to mention that the Air Force 
has a cardinal interest in the national problem concern- 
ing technical information. As can be seen by our efforts, 
we are going through a “development era” which we feel 
will have a great influence on the national technical in- 
formation picture. This is a natural feeling to come from 
a group that is engaged in developing techniques and 
hardware such as language research, print readers, 
automatic language translators, storage and retrieval 
devices, and disseminators. Equipment such as this 
will, out of necessity, play an important part in the na- 
tional picture in both centralized information systems 
efforts or in decentralized efforts. 

Being in the development field, one supposes we 
should have fine prediction qualities in the semantic- 
graphic data handling field. Frankly it boils down to 
studying the trends, following the curves and coming 
out with the statement that future equipment in these 
fields should have -faster scanning rates, higher excess 
speeds, greater packing power, lower power require- 
ments, lower cost, etc. However, anyone can make those 
predictions, but in speaking for a group which has a real 
invested interest in these fields, we feel the empirical 
exploitation of developed equipments should be aggres- 
sively pursued and that much more should be done in 
language research for both information retrieval and 
MT. We feel some effort is “coming about” in this field 
but many more “bold steps” must be undertaken. 
Mechanical translation by itself is a language problem, 
and, by its solution and future use, we only add more 
literature in the already heavily loaded field of storage 
and retrieval. Being engineers and scientists we tend 
perhaps as a group to shy away from the language re- 
search side of information retrieval and MT. However, 
we have slowly learned over the past few years that 
herein lies the ultimate solution to our immediate com- 
mon problem. 
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Computing Educated Guesses 

E. S. SPIEGELTHALf 


T O DISTINGUISH himself from the poor be- 
nighted man-in-the-street, the computer sophisti- 
cate is apt to refer to the beasts as “so-called giant 
brains” or as “lightning-fast idiots.” He knows, as do 
we, the great gulf which separates the human brain from 
the general-purpose digital computer. Still, the exact 
dimensions of that gulf are quite unknown, and the 
desire to show that the hiatus between man and ma- 
chine is smaller than many suspect impels both the ad- 
venturesome and the iconoclastic. The attempt, the 
successful attempt, to automate one area hitherto con- 
sidered an exclusively human domain constitutes my 
topic today. 

We are all familiar, if only by hearsay, with the 
troubles that can beset the best of computer programs 
if the input to the program is not thoroughly debugged. 
For a program designed to test the putative behavior of, 
say, a proposed steam turbine, where the input consists 
of a scant dozen or so parameters, input debugging is 
hardly a problem. The situation is quite different for a 
data-processing operation, particularly when the in- 
put is massive, as it usually is. Three alternatives, all un- 
pleasant, present themselves to the supervisor of such 
a large-scale data-processing operation. He can build a 
wide variety of error-detecting features into his pro- 
gram, flagging all input errors for subsequent human 
correction, he can employ a host of human pre-editors to 
clean up the input, or he can hope that input errors are 
rare, and let it go at that. 

Unhappily, there are many applications where errors 
are not rare, where the do-nothing solution is obviously 
frivolous and where, consequently, a sizeable group of 
humans is necessary, either as pre-editors or as on-line 
trouble-shooters. Nor is it always the case that the 
necessary human beings can be clerical types. Certain 
input debugging calls for sophisticated and knowledge- 
able practitioners. We are all hopeful — almost all, any- 
way — that keypunch machines and operators will sooner 
or later be superseded by character-reading devices and 
the like. There is no philosophical difficulty in conceiv- 
ing of typed, printed, or handwritten characters being 
translated directly into computer language without any 
human intervention, provided, of course, that those 
characters were correctly typed, printed, or written to 
begin with. Suppose, however, that the source char- 
acters are incorrect. Consider the ingenuity expended in 
the Post Office just in recognizing all the variations of 
“Albuquerque.” Our Russian colleagues are supposed to 
be far advanced in the domains of automatic translation 
and character-reading, but present their machines with 

f General Electric Co., Bethesda, Md. 


a first edition of “Cybernetics,” with all its typograph- 
ical errors, and horrible difficulties would ensue. Our 
choice, then, is clear. Either we admit that many im- 
portant data-processing applications are impossible to 
automate completely, or we find a way to mechanize the 
human capacity for making educated guesses. We be- 
lieve that, for some applications at least, we have found 
a way. 

While the techniques we have developed were con- 
ceived with one particular application in mind, I shall 
describe them without reference to that application, 
successful as it was. The principal reason for taking this 
tack is to be able to present the basic, quite general, 
features of our method without being tripped up by the 
special form-fitting required by the actual problem. So, 
let us be general, and consider any language with which 
humans attempt to communicate with one another. 
These may be natural languages, like English or Ger- 
man, or artificial languages like Esperanto or certain 
telegraphic codes. There are all sorts of personal rea- 
sons for communication being difficult — ignorance, 
dogmatism, poor sentence structure, etc.; however, 
even if these factors did not exist, all sorts of nonhu- 
man noise would beset would-be communicators. In- 
formation theory makes much of “redundancy” as an 
aid in error-detecting and error-correcting when a noisy 
channel is being used. Indeed, even humans who have 
never heard of information theory make continual, and 
skillful, use of redundancy in unscrambling all sorts of 
garbled communications, whether the trouble be cross- 
talk in a telephone conversation or missing letters in a 
crossword puzzle. Without attempting to build a model 
of the brain, replete with neural nets and such, let us see 
if we can single out the functions performed by human 
redundancy-exploiters. If these functions turn out to be 
performable without recourse to extrasensory percep- 
tion or to the psychokinetic effect, our automation 
problem is essentially solved. There remain only the 
minor problems of collecting all the necessary data, 
carrying out a rather gruesome programming task and 
finding a computer fast enough and capacious enough to 
make our solution practicable. I shall return later to 
this question of practicability. At the moment, allow 
me to sketch the functions which, when suitably pro- 
grammed, allow a general-purpose computer to simulate 
a redundancy-exploiting, error-detecting, and error- 
correcting human being. 

Rather than jump into a completely general and ab- 
stract formulation, let me use a concrete illustration. 
Fig. 1 shows two familiar sights, a correctly prepared 
mailing envelope and, below it, a somewhat sloppier ver- 
sion of the same thing. We shall assume at first that a 
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John I. Zilch 

40 Blueberry Lane 

Boston 3, Mass 

Computers Ltd. 

12345 E. 152nd St. 

Phoenix, Ariz. 


Att. : Mr. P. B. M. Smith 


(a) 

JohnlZilch 

40 Blueberry Line 

Bost. Massachuesets 

Computers Lid. 

112345 E 152 

Pheonlx.A. 

Attention Smith 


(b) 


Fig. 1 — Envelopes, (a) good, (b) bad. 


“perfect” character-reading device has “read” the per- 
fect envelope, and consider the functions which must be 
performed by a machine to “understand” the envelope, 
i.e., to route it to the correct addressee by the desired 
means, e.g., air mail or first class. We shall then consider 
a fallible character-reading device reading the lower, 
garbled, envelope, and see what can be done there. It 
should be emphasized that, in this application, we are 
not concerned with the essentially straightforward task 
of actually routing the envelope to its destination. Our 
job here is just to ascertain the information needed by 
the routing program. 

Our machine must perform two separate functions on 
each “word” read from the envelope. A word here is any 
group of contiguous characters on one horizontal line, 
not containing any embedded blanks or commas. Given 
any such word, the machine must first ascertain the 
class of words represented by this word. In our example, 
the machine must determine that “Phoenix” is the 
addressee’s city, and that “I.” is an initial of the sender. 
This first function is called the “identification” of the 
word. The second function is that of “recognition.” 
Having established the class to which a word belongs, it 
is next necessary to determine which one of the class 
members the given word represents. In our first example, 
the recognition process is simple, almost trivial. 
“Phoenix” is matched against every element in a master 
list of cities and, lo and behold, it is found that “Phoe- 
nix” is “Phoenix.” A glance at the lower envelope on 
Fig. 1 will reassure you that the recognition problem is 
not always a trivial one. 

The identification process is fairly easy for a Gestalt- 
perceiving, pattern-recognizing human who is himself 
accustomed to writing envelopes according to the stand- 
ard format. The machine needs a little help in this direc- 
tion. Fortunately, we can provide this help. On the one 
hand, we can program our machine to elicit the same 
data that our pattern-recognizing facility allows us to 
obtain. Clearly, our character-reader will be able to 
note, for each word, its relative position with respect to 
all other words on the envelope, and its position with 
respect to the envelope itself. For each word, then, we 


start off with the knowledge of the line it is on, its posi- 
tion on the line (left end, right end, interior) and the 
words which flank it on either side. With a little extra 
programming effort we can determine the length, i.e., 
the number of characters of each word, its character 
pattern (is it all alphabetic, all numeric, some sort of 
hybrid?) and, perhaps, the presence in the word of some 
salient feature, e.g., the colon following “ATT.:”. In- 
deed, we can usually determine quite easily much more 
information than we need for the identification of our 
words. Much more, that is to say, when we are dealing 
with a noiseless channel, and/or a communication for- 
mat as simple and relatively invariable as the front of 
an envelope. 

Of course, whether this information is adequate, 
overly complete, or inadequate depends on how we use 
it. At this point in the identification process, the machine 
must turn to its accumulated store of factual knowledge, 
a store which is compiled by a subsidiary program in 
advance of production running. This store consists of 
lists and tables of probabilities, and provides the data 
which, in conjunction with the specific information for 
each envelope, allow each word to be identified with a 
high probability of correctness. Our basic technique 
here is the use of Bayes Factors as instruments for 
weighing evidence. Fig. 2 gives the essentials of this 
technique. 

For each class of words that can occur in the specific 
type of communication in question — mail envelopes, in 
our example — an a priori probability is given for the 
occurrence of a representative (or two, or n) of that 
class. This probability, like all the others we use in this 
process, is derived from frequency counts on sufficiently 
large samples of the data to be processed. Also for each 
class, we provide the probabilities that, for example, a 
specific representative of that class will have length 
3, or 4, or 5, or that the class representative will be 
found at the beginning, or the end, of a line. In brief, 
for every piece of information we scan each envelope 
for, we have a corresponding set of probability distribu- 
tions, one set for each class of expected words. 

In the identification phase of our program, we con- 
sider one actual word at a time, testing that word 
against the hypotheses that it is a representative of ex- 
pected class A, B, etc. Eq. (1) in Fig. 2 gives the skeleton 
of such a test. Here we are testing the hypothesis that 
the word “Smith” is a representative of the “zone- 
number” class. Our frequency counting is supposed to 
have informed us that the a priori probability that any 
word on our envelope is in the zone-number class is 
0.017. We first test our hypothesis by using the empiri- 
cally-determined fact that “Smith” has length 5. This 
gives us our second term on the right side of (1), i.e., the 
Bayes Factor for the “length event.” The product of the 
Bayes Factor and the a priori probability is the a 
posteriori probability that “Smith” is a zone-number. 
Not very surprisingly, this is a small number. We now 
compare this number with two thresholds. If the a 
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P(H/Ei) = P(H) 


P(Ei/H) 


= P(H) 


P(Ei/H) 


P(Ei) 

= (0.017) (0.0001) =0.0000017 


P(£!/F)P(ff) + P(Ei/H)P(H) 

> 0.000001 = Rejection Threshold 
< 0.9 = Acceptance Threshold 


P(H/E 1 -E 2 ) = P(H) 


P{E l /E)P(E 2 /H-E l ) 


PiM) 


P(Ei/H) P(E 2 /H) 


P(E v E 2 ) * ' 7 P(Er) P(E 2 ) 

= (0.0000017) (0.00001) < 0.000001 = Rejection Threshold 


where 


H = hypothesis that “Smith” is a “zone-number” 

Ei = the event that the length of “Smith” is “5” 
x E 2 = the event that the pattern of “Smith” is “all alphabetic” 


( 1 ) 


(2) 


Fig. 2 — Hypothesis testing. 


posteriori probability exceeds the acceptance threshold, 
we accept the hypothesized identification and turn our 
attention to the next actual word; if the probability 
falls below the rejection threshold, we reject the hy- 
pothesis, and test the actual word against the next ex- 
pected word class. Finally, if our probability falls be- 
tween the two thresholds, we test the same hypothesis 
against the next event, using our a posteriori probability 
as the new a priori probability. In our example in Fig. 
2 we have been generously low with our rejection 
threshold, so that it is necessary to go to (2) where we 
test the hypothesis, “Smith == zone-number,” against the 
character pattern, and allow the low probability of a 
zone-number consisting exclusively of letters, to push 
our hypothesis into limbo. If we were scanning French 
addresses, with zone-numbers given in Roman numerals, 
the Bayes Factor in (2) would be very different. 

After scrutinizing all the actual words on the envelope 
in this manner, we may find that certain words are still 
unidentified. In this case, we iterate through our process 
once again. However, certain features of the process will 
have changed. Suppose that we have identified two 
different zone-numbers in the first pass. Since we expect 
to find no further zone-numbers, we no longer test any 
of our undecided actual words against the hypothesis 
that they are zone-numbers. This not only reduces our 
processing time — it also changes the a priori probabil- 
ities of the remaining word classes, and affects the num- 
bers entering into all the Bayes Factors. Another change 
in the second pass is that new evidence can be used to 
give rise to Bayes Factors. A word identified as a zone- 
number in the first pass provides strong evidence that 
the word to its left is a city name. Clearly, the topolog- 
ical relationships subsisting between words cannot be 
utilized until some words have been identified. 

If successive identification passes still leave a resid- 
uum of unidentified actual words, as might happen if, 
for example, two or more words were run together, thus 
appearing to the machine as one word, there are sub- 
sidiary tricks that can be played. Due to time limita- 
tions, I shall have to leave these tricks to your imagina- 
tion, and move on to the recognition phase. 


In the simplest case, all actual words will have been 
correctly identified and, if the words are all correctly 
spelled and correctly ingested by our character-reader, 
recognition will consist of little more than finding the 
exact match in the proper list, a list determined by the 
identification of the word. It is possible to make even 
this simple process simpler or, at least, faster. To search 
a list of all the cities in the United States can be time- 
consuming, particularly if the list must be transferred 
from tape to core memory. However, if the correspond- 
ing state has previously been recognized, then a much 
reduced list of cities can be inputted and searched. Sup- 
pose further that the corresponding zone-number has 
been recognized as “25.” Then we need consider only 
those cities in the given state which have at least 25 
zones. 

If we are bound to get a direct match whether we 
scan a big list or a little list, this process of list reduction 
is of secondary value only. It is when a direct match is 
not forthcoming that this technique assumes greater 
importance. In the absence of a direct match, we are 
constrained to use brute force techniques of a more or 
less sophisticated nature. If we are fortunate enough to 
reduce a list down to one entry, then we can avoid brute 
force completely. Failing this, we can expect two ad- 
vantages to accrue to the use of a reduced list, in general. 
First, for the same elapsed time, we can employ more 
brute force techniques per list entry; second, we can at 
least hope that, by reducing our initial list, we will ex- 
punge spurious candidates to which our brute force 
techniques might give scores equal to, or even greater 
than, the score of the correct candidate. For example, 
Fig. 3 gives one horrible example, often quoted in this 
connection. Recognizing (a) as being either “New 
York” or “Newark” is an awful job. A non-brute-force 
technique, such as list reduction, which removes the 
false entry from consideration is a welcome way of 
cutting this Gordian knot. 

Again for lack of time, I must give the actual brute- 
force techniques a very hasty treatment. Let me men- 
tion just two techniques of the many available. To 
match a word which has had two letters transposed [as 
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in (b) in Fig. 3], against the original word, we look for 
list entries with the same letter composition as our ac- 
tual word, i.e., entries with the same number of A’s, 
B’s, C’s, etc. Scanning these for a single transposition is 
relatively easy. 

A second technique is useful when a letter or two (or 
more) has been erroneously dropped from, or added to, 
a word, (c) in Fig. 3 is due to a stuttering typist who re- 
peated the first letter of the word. Two words run to- 
gether provide further examples of this kind of noise. 
What we try here is a direct match of our actual word 
with a proper subset of our list entries, and vice versa. 


(a) Newyark 

(b) Pheonix 

(c) Bboston 


Fig. 3 — Typical typographical errors. 

If no amount of brute force seems to work, and cer- 
tain words just cannot be recognized, we can either give 
up gracefully at this juncture or we can admit, even 
more gracefully, that one of our educated gusses might 
have been wrong. If we choose the latter alternative, we 
have the messy job of deciding whether we went hay- 
wire in the recognition phase, or all the way back in the 
identification phase. In either case, it is still necessary 
to find a likely spot for picking up the dropped stitch 
without causing the entire garment to unravel. Some- 
times, indeed, we are left with the original ball of wool. 
These, however, are almost always the cases which 
stump human editors. 

This ability to iterate back, and back, and back, can 
of course lead to excessive use of computer time. It does 
have its advantages though. It means that a bad guess 
is not an irrevocable misstep. It also means that various 
parameters, the identification acceptance and rejection 
thresholds, for example, are not nearly as critical as they 
would be in a once-through process. Since these are 
among the hardest parameters to estimate accurately, 
any diminution of their sensitivity is a positive gain. 

At this point, I should like to restate our major tech- 
niques in somewhat folksier terms than “Bayes Fac- 
tors” and “list reduction.” In our identification phase, 
we attempt to use the constraints imposed by the for- 
mat, mailing envelopes in our example, plus the con- 
straints of the language itself, the length and character 
patterns of the expected word classes, to provide a rudi- 
mentary form of pattern recognition. We then use our a 


priori knowledge of word statistics and interword rela- 
tionships to find the most probable matching of the 
actual words to the expected word classes. Human be- 
ings presumably use rank orderings of hypotheses, 
modified by intuition, to perform such matchings. Since 
machines lack intuition and since we have not yet devel- 
oped a calculus of rank orderings, we use the para- 
phernalia of Bayes Factors to accomplish the same task. 

Without undue stretching of the terms, we might say 
that, in the identification phase, we exploit the syntactic 
constraints on the language we are processing, whereas 
in the recognition phase, by our use of the list reduction 
technique, we exploit the semantic constraints. We 
might subsume both types of constraint under that 
much-abused word, redundancy. As for a folksy term for 
our brute-force techniques, the most accurate that occurs 
to me is “knowledgeable cynicism.” We expect errors to 
be made and, usually, we have some information as to 
the kinds and sources of error, as well as their frequencies 
of occurrence. If we know that typists frequently hit a 
key next to the one they should hit, we store the key- 
board pattern in our program; if we know that our 
character-reader frequently confuses “o” with “c,” that, 
too, goes into our dossier. 

A final word now as to the applicability and practi- 
cability of our techniques. What with tape searches and 
Bayes Factor computations, processing time may, but 
need not always, be excessive. The preparation of all the 
lists required in the recognition phase is a painful task. 
With a relatively stagnant language, this list-making 
can be a one-shot ordeal; with a volatile language re- 
quiring frequent updating of the lists, the pain might be 
unbearable. What has been said about lists also applies 
to the preparation of the probability tables for the 
identification phase. A final pause-giving consideration 
is the amount of redundancy in the language to be 
processed, particularly when the processor cannot es- 
tablish the language, which he sometimes can do. Our 
private feelings are that a language sufficiently low in re- 
dundancy to be unintelligible to a machine will also be 
unintelligible to a man. I won’t press a point which 
trods so heavily on anthropocentric toes. 

In summary, then, we feel our techniques can be use- 
ful in some massive data-processing applications, in 
automating post offices, in translating natural languages, 
where every second word in the source language has 
several correlates in the target language, and we know 
our techniques have worked at least once. I won’t ask 
that you take this on faith, though I’d appreciate it if 
you would. 




74 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


A Memory of 314 Million Bits Capacity with Fast 
and Direct Access — Its Systems and 
Economic Considerations 

N. BISHOPj and A. I. DUMEYJ 


T HE graphic arts laboratory of Time Inc. in 
Springdale, Conn., is a laboratory largely devoted 
to improving the arts of printing and paper- 
making. Of equal importance to a publisher of weekly 
news magazines is a large and well-serviced list of sub- 
scribers. This may serve to explain why there exists to- 
day in Springdale a completed and working engineering 
prototype of a large direct-access memory. This equip- 
ment was developed by the laboratories to prove the 
technical and economic feasibility of storing and proc- 
essing large basic record files in memories providing 
direct access facilities to individual records. Since our 
program of systems study, equipment development, and 
economics study has indicated such feasibility, a review 
of our findings may well open new avenues of approach 
to the solution of present and expected future problems 
in the expanding field of automatic data processing. 

The data-processing systems designer is constantly 
striving to strike an optimum balance between the costs 
of data storage, computer time, job requirements of in- 
put and output, and manpower. So fast is the art pro- 
gressing that today’s best balance for customer A may 
be quite different from that for customer B a year later, 
even though A and B present identical job requirements. 

The concept of large data-storage capacity coupled 
with direct access to individual records has long in- 
trigued the systems man. His interest is usually dimin- 
ished when he starts to consider the cost of storing large 
basic files in such a manner with the presently available 
memories providing such direct-access facilities. Either 
the storage cost per bit is too high or the access time is 
too high to keep up with the daily work load. Serial 
storage provides high capacity and low cost per bit, but 
it does not provide for the often essential requirement of 
direct access to individual records. There appears to be 
a definite gap in the systems man’s bag of tricks and 
hardware which needs filling by a direct-access memory 
of high capacity, reasonable access time, and reasonable 
cost per bit for storage. 

In designing a high-capacity direct-access memory 
for use in very large basic record files, it is necessary to 
make as many records as possible accessible to each 
processing station, if low cost per bit of storage is to 
result. At the same time, access times must be low 
enough to handle traffic requirements. Excessive pack- 
ing factors result in intolerable tolerance requirements 

t Time Inc., Springdale Labs. Div., Springdale, Conn, 
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for manufacture and maintenance. Excessive travel dis- 
tances from record to processing station result either in 
immoderate acceleration and torque requirements or 
excessive access times, if acceleration and torque re- 
quirements are kept within bounds. Too many records 
per access mechanism, with consequent increase in 
traffic per unit, can overburden the designer with access- 
speed requirements. The key to an approach which will 
meet systems and economic requirements is expressed 
in one word — moderation. A combination of moderate 
packing factors, moderate travel distances, moderate 
accelerations, and moderate average access times can 
result, and has resulted, in a fast and direct access 
memory with a record capacity sufficient for economical 
large basic file application. Let us now discuss its opera- 
tional characteristics. 

The significant characteristics of a direct access 
memory to a systems man are those which answer the 
questions: “How much data can I store?” “How many 
entries can I process per working day?” “What is the 
best available estimate of the cost per bit of storage 
plus provision for direct access?” 

As far as the bit capacity of presently available direct- 
access equipment is concerned, the upper bound seems 
to be a few tens of millions; and, on the cost side, the 
lower bound seems to be some ten bits for the penny. 
Access times in such memories vary from a second down 
to a few milliseconds, the latter figure reflecting the 
need to cover high activity against the file, as in the 
case of short-term airline reservations. 

The engineering prototype of our memory has a total 
information bit capacity of 314,500,000. This capacity 
is distributed in 262,144, (2 18 ), separate storage loca- 
tions within the memory, each location providing a 
message capacity of 1200 bits. The operation of the 
memory is under the control of three basic commands 
derived from peripheral equipment. The first command 
consists of an 18-bit parallel-fed address, followed by a 
seek record signal. The second is a process command 
calling for a read, write, or erase cycle. The third is a 
release-record command, which restores the memory to 
readiness for operation on the next series of commands. 

The maximum time required to execute a seek record 
command is 3.87 seconds and the minimum is 0.61 
second. Each process cycle, read, write, or erase, takes 
0.45 second. On completion of the last process cycle, 0.6 
second is required to restore the memory to a condi- 
tion of readiness for the next series of commands. 

These are the basic figures. Here are some examples 
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of how they would be applied to practical situations. 

In an information-retrieval system, in which addi- 
tions to the file tend to occur rarely, the over-all average 
working cycle is 3.14 seconds, which includes 0.45 
second for one read cycle. Maximum and minimum 
figures would be 4.92 and 1.66 seconds, respectively. 

In our own operation, about 95 per cent of the actions 
on the file call for five processing cycles per access. This 
results in an over-all working cycle of 6.72 seconds maxi- 
mum, 4.94 seconds average, and 3.46 seconds minimum. 
These last figures imply that, under an even traffic flow, 
over 2 per cent of the storage locations can be consulted 
and processed in an eight-hour day. 

Obviously, file arrangement, input form, and other 
considerations are important factors in arriving at a 
working average of access and processing cycle for any 
direct-access memory. The maximum, minimum, and 
average figures will permit a rule of thumb estimate of 
the applicability of this equipment to various problems. 

In our own case, there were serious queueing considera- 
tions and addressing limitations, which made the actual 
speed of the device available as a safety factor for an 
actual file activity of about 1 per cent of the file per day. 

Neither time nor circumstances permit a complete 
description of our engineering prototype at this time; 
however, some of its physical characteristics may be of 
general interest. Information bit rate in and out is 
20,000 serial bits per second. Information input and 
output levels are adequate for computer communica- 
tion purposes. Signal-to-noise ratio, measured as the 
ratio of peak signal voltage read from a recorded loca- 
tion to the peak noise voltage read from an erased loca- 
tion, exceeds 40 db. Semiconductor components are 
used throughout. The circuits within the memory 
cabinet provide all systems communication and control 
facilities except buffering. Power requirements are 550 
watts average and 750 watts peak load on a 115-volt 
60-cycle single-phase circuit. The prototype, which 
weighs 4800 pounds including its glass and steel en- 
closure, occupies a floor space of 51 inches by 62 inches 
and is 95| inches high. It is designed to operate in 
average office conditions of temperature, humidity, and 
dust content. 

This equipment is now set up for demonstration and 
cycled testing in the laboratory. It operates under the 
control of a device which was designed to run it in a test 
mode. The test equipment consists of a Flexowriter, 
1200-bit shift register buffer, comparator and control 
circuits which also provide coding interlocks. Keyboard 
control, tape reader control, or a combination of the 
two is possible; and information may be stored, edited, 
justified, transmitted, and received. Special test tapes 
have been prepared which permit repetitive cycles of 
access and processing at rates, and in a manner, simu- 
lating hard usage in a system. The comparator and 
counter circuits associated therewith permit a wide 
variety of error checks. Input and output can be repre- 
sented on hard copy, punched tape, or a combination of 
the two. Many test runs have clearly indicated the few 


desirable design modifications for inclusion in a produc- 
tion unit. 

In order to complete our economic studies on the ap- 
plication of a direct access approach to the handling of 
our subscription records, we had an independent manu- 
facturer prepare a manufacturing cost estimate on the 
basis of duplicating the prototype memory in quantities 
necessary for such application. This estimate indicated 
a cost of storage of 2 cents per bit or less. Attractive as 
this figure of cost per bit for direct-access storage may 
be, the cost of storing billions of bits of data, such as a 
large publisher’s subscriber records, represents a con- 
siderable investment for storage alone. However, the 
compensating factors are considerable. First, a poten- 
tially better grade of subscriber service is possible, based 
on faster action on subscriber requests; and second, a 
considerable saving can be made in costly computer 
decision-making time, as a result of limiting such com- 
puter time requirements to entries demanding decision. 

The original specifications for our memory were based 
on the concept of storing all file data on each subscriber 
in one addressable location of 1200-bit capacity. As our 
studies progressed, it became apparent that it is not nec- 
essary to store all file data under direct access, since 
many of the data are not required for daily file main- 
tenance, and can be stored to better systems operational 
advantage in serial access memories. Since we planned 
to store the name and address of subscribers on serial 
tape for operating address label printers, and the name 
and address always appear as input with each action 
against the file, it is unnecessary to store these data under 
direct access. Complete billing information can also be 
stored on tape, as it is not required for the daily main- 
tenance job. It was decided to limit the storage per sub- 
scriber under direct access to the minimum required for 
decision-making in daily file maintenance plus the few 
characters necessary for minimizing duplication. The 
data retained are sufficient to provide systems communi- 
cation with related files. Thus, it is now possible to 
store the decision-making data on two subscribers in 
one 1200-bit storage location. While traffic and overflow 
requirements have not permitted a 50 per cent reduc- 
tion in the number of direct access units required in our 
system, they have enabled a 30 per cent reduction with- 
out impairing systems performance. 

Efficient usage of direct-access storage capacity re- 
quires particular attention to the matter of distribution 
of entries within the file. This is especially true when 
input to the file is uncontrolled, as in a subscription serv- 
ice operation. An answer to efficient distribution within 
the file was found in an addressing procedure which 
translates a subscriber’s name and address into a ran- 
dom and reproducible address within the file. Overflow 
capacity within each memory is reserved to handle new 
entries addressed to fully occupied storage locations. 

An action against the file usually begins with a letter 
including the subscriber’s name, address, and requested 
action. The file location for a subscriber, whether a new 
or an old customer, is derived from the most reliable 
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portions of the name, street address, city, and state. As 
stated above, to attain efficient usage of memory ca- 
pacity, the data supplied by subscribers must be manip- 
ulated by a computer into addresses which distribute 
randomly over the field of available storage locations; 
and furthermore, identical subscriber data must always 
produce the same address. We have chosen a computing 
technique which, in brief, considers the subscriber-sup- 
plied data as a binary number, divides it by a prime 
number almost as large as the number of addressable 
locations, and uses the remainder as the address. 1-3 
Such randomizing techniques lead to occasional dupli- 
cation, and the system must be capable of distinguishing 
between such cases and of providing accessible storage 
for each. Entry data comparison with data stored at an 
occupied address in the memory provides this distinc- 
tion. If the comparison indicates that the entry is in fact 
a new one, and if the location is fully occupied, the new 
entry is referred to a directory of unallocated storage 
locations specifically reserved for such overflow cases. 
The first such location address readout directs the 
access to this location where the new entry data are 
stored. This address is, at the same time, erased from 
the overflow directory and recorded in the original stor- 
age location. Subsequent access to the new entry is at- 
tained without reference to the directory, as the over- 
flow address is now stored in the addressable location 
computed from subscriber-supplied data. The directory 
of available unallocated storage locations is kept cur- 
rent by re-entering storage locations allocated to over- 
flow when they are emptied by subsequent file action. 
This approach to the attainment of economical distribu- 
tion of entries in a direct-access file system aopears to 
result in a favorable balance between the total number 
of addressable locations required and the multiple ac- 
cesses required to reach entries stored in locations allo- 
cated to overflow. Other equally favorable solutions 
may result from specific choice of final systems com- 
ponentry. 

The principal output of a systems design for sub- 
scriber service is in the form of address labels printed 
on a weekly schedule. Conventional tape storage is the 
logical way to store the mailing list for efficient control 
of output printers. This requires weekly preparation of 
a new mailing tape, which is produced by collating a 
weekly change tape with the previous week’s mailing 
tape. Computer requirements for this relatively simple 
collating operation are moderate, and our studies indi- 
cate that low-cost computers are adequate for this opera- 
tion. A tape converter is included to translate the out- 
put of the file maintenance portion of the system into 
the tape format used by the collator. Decision-making 
computer requirements for file maintenance have been 

1 L. N. Korolev, “Coding and code compression,” J. Assoc. Comp. 
Mach., vol. 5, pp. 328-330; October, 1958. 

2 W. W. Peterson, “Addressing for random access storage,” IBM 
J. Res. Dev., vol. 1, pp. 130-146; April, 1957. 

3 A. I. Dumey, “Indexing for rapid random access memory,” 
Computers and Automation, vol. 5, pp. 6-9; December, 1956. 


minimized, since the daily work load is limited by direct 
access storage to an estimated average of 100,000 items 
per day against a total file of 12,000,000 entries, some 
nine to ten million of which are active. 

We proposed to use a certain medium size computer 
for updating, because, although the list size is 12,000,000, 
the daily work load of this portion of the system is in 
the range of 100,000 items. In other words, instead of 
having to cope with about 400 items per second in order 
to run through the file in one day, the file maintenance 
equipment has to handle about four. In recapitulation, 
we can say that the more complex items, totalling about 
100,000, make a day’s work for a medium-size com- 
puter, while the 9,000,000 items of mailing information 
(only | of the file is active) are handled by simpler tape 
collators. 

Communication equipment was required to join the 
memories with the main computing element. Our plans 
called for a combination of a limited number of buffers, 
and a cross-bar switching arrangement, the speeds of 
each being consistent with the expected ratio of work- 
ing cycles to access or rest times. Other details, too 
bound up with the specific use of the equipment to be 
of interest, will not be discussed. We did consider, as 
every user of new equipment must, the problems of 
conversion, including build-up of the file and parallel 
operation of the new and old system. 

Our systems studies indicated that conventional tech- 
niques and readily available equipment could handle 
the systems functions of input, communication between 
computers and associated memories, control and 
switching, and output printing. We shall not dwell on 
these systems aspects other than to say that their com- 
bination with the processes described above resulted in 
a positive answer to the question, “Can direct access 
memories be applied with good economy to the daily 
maintenance of very large subscriber files?” 

Our subscription service is a large-inventory, high- 
volume, data-handling problem with many complexities 
due to the nature of the weekly news magazine business. 
The amount of equipment necessary to the solution of 
this problem is determined by the job requirements and 
not by the operational characteristics of the direct-ac- 
cess memory we have described. There are other appli- 
cations where the combination of such a memory with a 
small-scale computer will perform all necessary systems 
functions. Newly developed and working equipment 
opens consideration of new fields for electronic data 
processing, and it is our belief that this development will 
provoke such consideration. 

We have shown that very favorable costs per bit can 
be achieved if traffic requirements are modest. Systems 
considerations have been noted. Testing experience has 
been given. Addressing and retrieval ideas have been 
set forth. We believe that a workable, large-scale, direct- 
access memory, with an access time of a few seconds, 
has a place in the roster of useful data-processing equip- 
ments, and we invite your attention to its possibilities. 
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Information Retrieval on a High-Speed Computer 

A. R. BARTONf, V. L. SCHATZf, and L. N. CAPLANf 


T HIS discussion concerns a mechanized informa- 
tion retrieval system for the technical library of 
General Electric’s Aircraft Gas Turbine Division. 
The system is confined to the technical reports and 
papers available in the Division’s Library. Textbooks 
have not been included as of yet since they are carried on 
a Library of Congress system and did not readily fit 
into the manual scheme which was developed. This dis- 
cussion will be in two parts. The first part will cover the 
information retrieval system prior to mechanization, 
and the second part will cover the mechanization of this 
system. 

The technical library was established in 1953 using a 
Uniterm or key word coordinate indexing system. To 
understand this system, we will follow the progress of a 
publication through the various steps of the system. 
First, as a publication is entered into the library, it is 
assigned a six-digit access number (see Fig. 1). This re- 
port is typical of the technical reports generated within 
the Division. Another example of reports in the library 
would be NACA technical reports. The next step in this 
system is the abstracting of the document. The ab- 
stracting is done by professional librarians and then 
posted to a card file as shown in Fig. 2. This card is 
controlled according to the previously assigned access 
number. The next step in the system consists of review- 
ing the title, abstract, and document to select the most 
descriptive words which will identify the document. 
These words are primarily nouns. They become the uni- 
terms. In our hypothetical case, these words are shown 
on the right in Fig. 2, just as they appear in the system. 

These uniterms, along with the appropriate access 
number, are then posted to the Uniterm file (see Fig. 3). 
There are 100 numbers per side of card when full. Both 
sides are used, and in the case of general terms such as 
these, they are heavily posted so that several cards are 
required. Certainly this system appears cumbersome at 
this point, but it has the advantage that, in any given 
technical area, the number of uniterms tends to level 
off at a specific number after the system is developed. 
In our case, this number is something under 9000. The 
combined system is shown schematically in Fig. 4. I 
think now you can see how information is recalled from 
this system. The requestor discusses his problem with 
the librarian. They decide on the uniterms to search. 
The librarian then furnishes the appropriate Uniterm 
cards to the requestor. Once again referring to Fig. 3, 
the problem facing him can be seen. He must cross co- 
ordinate the cards to find numbers which apply to all 
uniterms. In our case, we have used three uniterms. A 

f General Electric Co., Cincinnati, Ohio. 


more typical case would be four to six uniterms but 
with less access numbers per term. In any case, if the 
requestor is persistent, he will come up with some of the 
matching numbers. The librarian can then go to the ab- 
stract file for the abstract cards. These are perused by 
the requestor who in turn selects from the abstracts 
those documents he desires to read in full. In the case 
of the three uniterms we have selected, each has over 
1000 access numbers posted to it. It is easy to see the 
difficulty of cross coordinating these cards. This dif- 
ficulty did little to promote the use of the technical li- 
brary. The result was duplication of experiments, tech- 
nical studies, etc., with the attendant delays in time and 
increases in cost. 

Obviously, something better was needed. That “some- 
thing better,” we feel, was the program written for our 
IBM 704. This program is basically a mechanization of 
the manual system with very little effort to change the 
system itself. This program is in two parts. Part one is 
file updating and the cross coordinating of the master 
Uniterm tape, or, referring to the manual system, the 
Uniterm card file. Part two concerns the selection and 
printing of abstracts. (Part one can be run independ- 
ently of part two if desired.) 

Cross coordinating, at present, is done on a strict 
AND system. That is, an access number must appear 
under each Uniterm used in the search. The possibility 
of an AND/OR system was considered and rejected. 
However, the AND/OR approach is being used on a 
modification of this program for a different type of in- 
formation retrieval system. If no information is found, 
a modified list of uniterms can be developed by the re- 
questor and the librarian, and another run made. 

It was decided during the development of the pro- 
gram that one of the features of part one was to update 
new information into the file at the time of cross co- 
ordinating. This decision was based on the knowledge 
that in any information retrieval system there is a major 
problem of keeping the system updated. Thus, all uni- 
terms with their associated access numbers were stored 
on the master Uniterm tape in alphabetic order. 

All search information and updating information can 
be read into the machine from either the card reader or 
tape. This information is processed to determine if it is 
in alphabetic order and if there is any updating infor- 
mation. If the input is not in alphabetic order, it will be 
sorted within the machine. If there is no updating, a 
new Uniterm master tape will not be made up. Both of 
these decisions are made within the machine by inter- 
preting the input information. 

To utilize best the 32,000 locations of memory in the 
704, it was decided to use long records of information. 
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Fig. 2 — Abstract card with uniterms. 


Many of the uniterms, with only a few access numbers 
posted to them, are combined into one record with a 
total length of not over 7000 words. A total of 9000 lo- 
cations is reserved for reading in these records. If hny 
record exceeds 7000 words, the program will try to 
separate this record into two records. If the record ex- 
ceeds 8995 words and cannot be broken down, the pro- 
gram will be halted. Because of this factor, no Uniterm 
can have over 8995 access numbers posted to it. For a 
larger system, this could be easily modified so that there 
would be no limit to the number of access numbers 
posted to a Uniterm. Another feature to help cut down 
on the length of records is whenever a new Uniterm is 
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Fig. 3 — Uniterm wheel cards. 



Fig. 4 — Flow chart of manual system. 


added, this Uniterm will start a new record. This will 
also tend to break one record into two records because 
this Uniterm may fall alphabetically between two uni- 
terms that are in one record (see Fig. 5). 

As the machine does the cross coordinating, each 
search is stored in variable length buckets in memory. 
The total length of these buckets is also 9000 words. 
If the amount of information stored in these buckets ex- 
ceeds 9000 words, the Uniterm that caused the overflow 
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Fig. 5 — Updating Uniterm master tape. 

is stored in a temporary area and the search continues. 
As more uniterms are read and cross coordinated, the 
length of the buckets is decreased, permitting the addi- 
tion of another search at the end of the last bucket. 
When the master tape has been completely read, the 
program rewinds the tapes and makes a second pass on 
the tape using the uniterms from the temporary area. 
At this point, let me stress that a second pass is only 
made if there is an overflow on the first pass (usually 
over 50 searches). In most instances, this will not occur. 
If on the second pass all uniterms cannot be processed, 
the program will notify the operators that the search 
is too large and must be made smaller. 

The present system contains about 35,000 abstracts 
with an average of ten uniterms each, or a total of over 

350.000 numbers posted to the master Uniterm tape. 
Part one will handle 99 searches at once with no limit 

on the number of uniterms per search. It will also handle 
unlimited updating. A normal run is about thirty 
searches and updating of about 2000 access numbers 
into the general file. The time required is about two 
minutes for searching and four minutes for updating, or 
six minutes total. Tape assignments for part one are 
shown in Fig. 6. 

Part two is the printing out of all abstracts which cor- 
respond to the access numbers discovered in the first 
part. One million abstracts have been allowed for in the 
program. Timing for part two is approximately four 
minutes per 10,000 abstracts. At the present time, 

10.000 abstracts are placed on a master tape in numeri- 
cal order by access number. A statistical study is now 
being conducted so that abstracts will be in order by 
frequency of use. This will significantly improve the 
timing for part two as we expand our system. Tape as- 
signments for part two are shown in Fig. 7. 

Originally it was thought that the program would, in 




most cases, go right on into part two. Therefore, re- 
ferring to Fig. 6, it can be seen that the only tapes avail- 
able at this time are tapes 7 and 8. The first abstract 
master to be read is found on unit 7, the next on unit 8. 
While the program is searching tapes 7 and 8, the 
operator can mount other abstract master tapes on 
units 4, 5, and 6. After each master tape is searched, 
another abstract tape can be mounted. A continuous 
loop is set up selecting tapes 7, 8, 4, 5, 6 — - 7, 8, 4, etc. 
The number of tapes to be read is determined by an in- 
put card. All access numbers found in part one are 
sorted into numerical order before starting part two. 
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At the beginning of each master tape is an indication 
of the range of access numbers of that tape. Each num- 
ber that the program is looking for is compared against 
this range, and if it is not on this tape, an on-line com- 
ment will tell the operator we are finished with this tape. 
The program will then modify all addresses to pick up 
the next tape unit in sequence. 

The output of this system is in two parts as shown in 
Fig. 8. At the top is a sample listing of those access 
numbers located by the system. Below is a set of single 
sheets of printing. Each sheet contains the abstract cor- 
responding to one access number as shown in the column 
on the left. Each sheet is pre-addressed for direct mail- 
ing. If there is a security classification, it is shown. If for 
any reason it is considered necessary to suppress print- 
ing of security or proprietary information, this suppres- 
sion is under the control of a sense switch. It is now 
possible for the requestor to review the abstracts, select 
those for which he would like to receive the original 
document, check the appropriate access number on the 
list on the left, and return this single sheet to the 
library. 

I think that many of the advantages of this system are 
obvious, among which are speed, cost, designation of 
security classification, and the direct mailing feature. 
Advantages that are not obvious are: 

1) Reduction in amount of human handling with the 
resultant errors. 

2) All information is in narrative and is alphabetic. 
There is no coding. 

3) Complete abstracts are readily available to the 
requestor. 

4) The need for extensive manual files is eliminated. 

5) No information ever need be removed from the 
system so no information can be lost. 

This system has been in operation in General Electric 
since September, 1958. Some of the modifications that 
we have found to be advantageous through experience 
are: 

1) In most cases it is desirable to stop the program 
after part one and examine the print-out before 
going on to part two. 

2) It is advantageous to be able to supply to part two 
the ranges of access numbers wanted on each 
search. 

3) An upper limit should be placed on the amount of 
abstracts to be printed if it is requested that the 
program continue on into part two without stop- 
ping after part one. 

4) A method of combining several uniterms into one 
composite term on any individual search is ex- 
tremely necessary. 



5) If any Uniterm reduces the number of access num- 
bers found to zero, this term is eliminated and the 
search continues as if this word was not given in 
this search. 

These are only a few of the conclusions we have 
reached. Future experience, we are sure, will dictate 
many other additions or deletions to this program. Of 
course, we are looking forward to the time when with 
new equipment we will be able to search tapes simul- 
taneously, thereby reducing our running time by a fac- 
tor of approximately 10 to 1. 

We are presently planning to expand this system to 
mechanize the records involved in checking material 
into and out of the library. We also plan to develop an 
automatic overdue notice system. 

This information retrieval program has enjoyed wide 
acceptance in our plant. We have received requests to 
modify the program to process various types of person- 
nel registers, engine test data files, specialized blueprint 
files, and various other types of information systems. 
In any place where the key word coordinate indexing 
system or some variation of it can be used, this program 
seems to be the answer to many of our problems. 
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The Next Twenty Years in Information Retrieval: 
Some Goals and Predictions* 

CALVIN N. MOOERSf 


A Historical Perspective 

ALTHOUGH information retrieval has lately be- 
come quite a fad, I intend in this paper to stand 
^ ^ back and take an unhurried look at what is going 
on, and try to predict where this field must go and what 
it must do in the future. “Information retrieval” is a 
name that I had the pleasure of coining only eight years 
ago, at another computer conference. 1 The name has 
come a long way since then. 

In thinking about a definition of information retrieval 
and in considering the future of this field, we must take 
an evolving view. At the present time, information re- 
trieval is concerned with more than the mere finding and 
providing of documents; we are already concerned with 
the discovery and provision of information quite apart 
from its documentary form. To encompass future de- 
velopments, as we shall see, even this broad view of in- 
formation retrieval will have to be modified and ex- 
tended. 

When we speak of information retrieval, we are really 
thinking about the use of machines in information re- 
trieval. The purpose of using machines here, as in other 
valid applications, is to give the machines some of the 
tasks connected with recorded information that are 
most burdensome and unsuited to performance by hu- 
man beings. At all times, it is important to remember 
that it is the human customer who uses the informa- 
tion-retrieval system who must be served, and not the 
machine. It makes a difference who is served, and this 
little matter is sometimes forgotten in computer proj- 
ects. 

To get a historical perspective of the introduction of 
machine methods to information retrieval, let us look 
back over a bit of history. I think that it can be said that 
the introduction of machine methods has followed the 
realization of a need, backed by pressure and means to 
do something about the need. Thus, although quite 
powerful mechanical methods could have been de- 

* This work has been supported in part by the U. S. Air Force 
Office of Sci. Res. through Contract No. AF 49(638)-376. All opinions 
are those of the author. 

f Zator Co., Cambridge, Mass. 

1 C. N. Mooers, “The Theory of Digital Handling of Non- 
Numerical Information and its Implications to Machine Economics,” 
Zator Co., Cambridge, Mass., Tech. Bull. No. 48, 1950; paper 
presented at the March, 1950 meeting of the Association for Com- 
puting Machinery at Rutgers University, New Brunswick, N. J. 

C. N. Mooers, “Information retrieval viewed as temporal signal- 
ling,” Proc. Internatl. Congr. of Mathematicians, Harvard University, 
Cambridge, Mass., vol. 1, p. 572; August 30-September 6, 1950. 


veloped by the technology of the Hellenistic Era for the 
Library of Alexandria, other methods of retrieval, pre- 
sumably based upon human memory, and the making of 
lists, were apparently considered quite satisfactory. The 
simple, though powerful, mechanical technique that 
could have been used at Alexandria is the method of per- 
forated stencils invented by Taylor in 1915, which has 
sometimes more recently been called “peek-a-boo.” 2 
The British inventor Soper in 1920 patented a device 
which was an improvement upon Taylor’s perforated 
stencils, and Soper described the use of his mechanism 
for information retrieval employing some truly ad- 
vanced conceptions. 3 

Much more attention, however, has been given to the 
development of devices for scanning and selecting upon 
film strips. This work was apparently spurred by the 
perfection of motion picture films and cameras. Gold- 
berg in 1931 patented one of the earliest film-scanning 
and photographic-copying devices. 4 Independently, 
Davis and Draeger during 1935, in the early days of the 
American Documentation Institute, in connection with 
their pioneering work in microfilm documentation, in- 
vestigated the feasibility of a microfilm scanner using a 
decimal coding. 5 Apparently stimulated by reports of 
this work, V. Bush and his students at M.I.T. in 1938- 
1939 built perhaps the first prototype machine along 
these lines, a microfilm scanner with each frame of text 
delineated by a single decimal code for the subject, and 
a photo flash copying method. However, they were un- 
able to interest any commercial or governmental or- 
ganization in the device, and wartime distractions inter- 
vened soon thereafter, so the project was dropped. Two 
more “rapid selectors” based upon these same general 
principles have been built, 6 - 7 but for various reasons 
neither of them has operated in a fashion that is consid- 

2 H. Taylor, “Selective Device,” U. S. Patent No. 1,165,465; 
December 28, 1915 (filed September 14, 1915). 

3 H. E. Soper, “Means for Compiling Tabular and Statistical 
Data,” U. S. Patent No. 1,351,692; August 31, 1920 (filed July 23, 
1918). 

4 E. Goldberg, “Statistical Machine,” U. S. Patent No. 1,838,389; 
December 29, 1931 (filed April 5, 1928). 

6 R. H. Draeger, “A Proposed Photoelectric Selecting Mechanism 
for the Sorting of Bibliographic Abstract _ Entries from 35 mm 
Film,” Documentation Inst, of Science Service, Washington, D. C. 
(now American Documentation Inst.), Document No. 62; July 27, 
1935. 

6 R. Shaw, “Rapid selector,” J. Documentation, vol. 5, pp. 164- 
171; December, 1949. 

7 Anonymous, “Current Research and Development in Scientific 
Documentation,” National Science Foundation, Washington, D. C., 
Rep. No. 3, p. 27; 1958. 
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ered generally acceptable, and neither is currently in 
actual use. At the present time, much attention is fo- 
cused upon the Eastman Minicard machine, a cross be- 
tween a rapid selector and a Hollerith punched-card ma- 
chine. 8 

Card-sorting devices, such as those based upon the 
Hollerith card (the card used by IBM), as well as those 
based on other cards such as Perkins’ marginally 
punched card, were recognized at an early date to be far 
too slow to cope with problems such as those of Chemi- 
cal Abstracts. Within the past few years, there have been 
a number of instances of the use of electronic computing 
machines to perform information retrieval. As comput- 
ing machines are presently designed, they are not 
matched to the job of information retrieval — they can 
do it, though not efficiently — and the situation of using 
a computing machine for this job is like using a bulldozer 
to crack peanuts. Oftentimes, if the information collec- 
tion is small enough to allow the problem to fit upon the 
computer, there are easier methods to perform retrieval. 
If the collection is large, it does not have to be very 
large to tie up all the computer’s memory capacity. It 
is clear that special computer-like devices will be called 
for if we are to perform efficient large-scale information 
retrieval. 

Although we have been trying to build high-speed se- 
lecting machines for information retrieval over the past 
twenty years, since the date of Bush’s machine, at the 
present time I do not think that it can honestly be said 
that we have done too well. We do not really have a ma- 
chine which is an altogether happy answer to the prob- 
lems of search and selection on collections ranging in 
size upwards from fifty or one hundred thousand items. 
The problem becomes even more unmanageable at the 
million point, since this size of collection requires rea- 
sonably high-speed processing and decision on a scanned 
record of something like 10 9 bits. 

However, the hardware will be built — and is being 
built. But what about the classification terminology, 
the subject headings, the descriptors, and the like? One 
after another, various machine projects have foundered 
on this problem, especially those projects that have 
copied library classification decimal systems or made 
use in a detailed way of their indexing techniques. We 
should appreciate that new mechanisms deserve new 
methods, and that there is a consensus of opinion (al- 
though it is not unanimous) that the method of putting 
together independent idea-expressing terms and select- 
ing upon their correlative occurrence constitutes the de- 
sired point of departure from the historic methods of the 
library. 

A highly developed form of this point of view is the 
method of “descriptors,” which was introduced and de- 

8 A. W. Tyler, W. L. Myers, and J. W. Knipers, “The application 

of Kodak Minicard system to problems of documentation,” Amer. 

Documentation, vol. 6, pp. 18-30; January, 1955. 


veloped in theory in 1948-1950 in a number of papers in 
conjunction with a mechanical card selector. 9 The de- 
scriptor method, which makes a great point of employ- 
ing precisely defined terms composing a limited vocabu- 
lary, is a refinement of a number of earlier practices. 
The method was implicit in the work of Soper, it was 
toyed with and dropped by the librarian Bliss, and it 
was used in one fashion or another by a number of scien- 
tists and chemists with Perkins cards in the 1940’s, e.g., 
by Bailey, Casey, and Cox. 10 People seem to confuse 
descriptors with Uniterms. The latter might be de- 
scribed as a crude form of a descriptor system, originally 
making use of words lifted from titles and texts. The 
Uniterm approach, since it was introduced in 1951, 
seems however to be migrating both in concept and us- 
age towards the descriptor methods, as is clear from 
many reports coming from projects where they claim to 
use Uniterms. 

The problem of classification terminology or language 
symbols for machine retrieval is well toward a solution, 
even for complex and structured kinds of information. 
An example is the work on the coding of chemical struc- 
tures for machine retrieval. 11,12 However, it should be 
noted that considerable work on retrieval of structured 
information, especially for chemical compounds, has 
sometimes resulted in symbolism that is not completely 
suitable for machine use, as for example some of the 
methods considered by the International Union of Pure 
and Applied Chemistry. 

The Present State of Affairs 

Although we may soon have suitable machines for 
large-scale information retrieval and although the situa- 
tion with respect to the language symbols of retrieval is 
in a reasonably satisfactory state (that is, ahead of the 
machines) we are not yet finished with our problems. 

Presuming that we have a machine completely ca- 
pable of dealing with a collection of one million— or even 
a hundred million — items, who will read these items or 
documents and assign the descriptors? Experience has 
shown that this is a difficult and time-consuming job. 
For example, in my experience in reading and coding 
patents, it takes me about fifteen minutes of reading, on 
the average, merely to figure out what the inventor is 
driving at. The Patent Office has some three million of 
such patents. 

This is exactly the kind of burdensome job that 
should be turned over to the machine. In fact this prob- 
lem is under active consideration and study in a number 

9 C. N. Mooers, “Zatocoding and developments in information 
retrieval,” ASLIB Proc., vol. 8, pp. 3-22; February 1956. This paper 
summarizes these developments. 

10 C. F. Bailey, R. S. Casey, and G. J. Cox, “Punched-cards tech- 
niques and applications,” J. Chem. Ed., vol. 23, pp. 495-499; 1946. 

11 C. N. Mooers, “Ciphering Chemical Formulas — The Zatopleg 
System,” Zator Co., Cambridge, Mass., Tech. Bull. No. 59; 1951. 

12 L. C. Ray and R. A. Kirsch,' “Finding chemical records by 
digital computers,” Science, vol. 126, pp. 814-819; October 25, 1957. 
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of places. It is not an easy task to give to a machine. It 
contains a great many aspects that would seem to re- 
quire the exercise of real “intelligence.” Fortunately, 
we already have one remarkable accomplishment which 
shows that this seemingly intellectual job is not com- 
pletely incompatible with mechanization. I speak of the 
work by Luhn on his “auto-abstractor .” 13 By the method 
of Luhn, the computer takes in the text of an article, 
statistically picks out the unusual words, and then 
chooses sentences containing these words to make up 
the auto-abstract. If this process were terminated at the 
point of picking out the words, we would have Uni- 
terms. If the words picked out in this fashion could be 
replaced by standardized words having approximately 
the same meaning, that is, if the synonyms could be 
eliminated, then we would have descriptors. It should 
be noted that this kind of treatment of synonyms, which 
has been going on in retrieval for some years, has lately 
been given the fashionable name of “the thesaurus 
method.” In the interests of precision in terminology, I 
should like to point out that there are significant differ- 
ences in Roget’s concept of a thesaurus and the set of 
equivalence classes of terminology that are required 
for retrieval. Indeed, this is precisely why I introduced 
the new terminology “descriptor” some years ago, that 
is, to give a verbal handle for a group of new conceptual 
methods with language symbols. 

Such a take-off on Luhn’s method would not be the 
final answer, because as Luhn has set it up, the machine 
is operating in an essentially brainless fashion. To do 
better than merely picking up words on a statistical 
basis, we would have to build into the method the capa- 
bility of handling the equivalence classes of words and 
phrases. This gets us into language translation. After 
the statistical approach has segregated words of high 
import from the text, we need to translate these words 
into the standardized descriptor terminology for further 
retrieval. However, even building up the equivalence 
classes of the terminology is a burdensome job, and this 
too should be turned over to the machine. Not only 
should the machine build up these equivalence classes, 
but it should be made to refine its performance with re- 
spect to using these terms and getting the descriptors, 
and it should even be made to learn how to improve its 
performance. 

Fano and others have suggested the use of statistics 
on the way people come in and use the library collection 
in order to provide feedback to help a machine improve 
its performance . 14 While the suggestion is in the right 
direction, I think that this kind of feedback would be a 
rather erratic source of information on equivalence 

13 H. P. Luhn, “The automatic creation of literature abstracts,” 
IBM J. Res. Dev., vol. 2, p. 159; April, 1958. 

14 R. M. Fano, “Information theory and the retrieval of recorded 
information,” in “Documentation in Action,” J. H. Shera, A. Kent, 
and J. W. Perry, eds., Reinhold Publishing Corp., New York, N. Y., 
ch. 14-C, p. 241; 1956. 


classes, because people might well borrow books by Jack 
London and Albert Einstein at the same time. While this 
difficulty can be overcome, there is a more severe prob- 
lem. Any computation of the number of people entering 
a library and the books borrowed per day, compared 
with the size of the collection, shows, I think, that the 
rate of accumulation of such feedback information 
would be all too slow for the library machine to catch up 
to and get ahead of an expanding technology. 

In this respect, it is my speculation that a more pow- 
erful source of educational material for a machine is al- 
ready available, and it should be tapped. Despite the ad- 
mitted limitations of such material, the subject entries, 
the decimal classification entries, and the other content 
typed on catalog cards contains a great deal of ready in- 
formation that can be used in teaching a machine how 
to assign descriptors to documents. Other collections, 
besides those in the libraries, also often provide a ready 
source of classificatory information that should be 
tapped. For instance, in the Patent Office, in each case 
record of each application for patent, there is a great 
amount of specific reference to other related patents, 
and this information, along with the assigned class num- 
bers, is readily available for machine digestion without 
further high-level human intellectual effort. 

In order to do these things, we shall need a machine 
with some rudimentary kind of “intelligence;” or more 
accurately, we shall need an “inductive inference ma- 
chine” in the sense used by Solomonoff . 15 An inductive 
inference machine is one that can be shown a series of 
correctly worked out examples of problems, that can 
learn from these problems, and that can then go ahead 
on its own (probably with some supervision and correc- 
tive intervention) to solve other problems in the same 
class. While an inductive inference machine can be quite 
capable at a given class of jobs, it need not have “brains” 
or “intelligence” in the general sense. 

As I mentioned before, putting the descriptors on the 
documents — that is, delineating the information in the 
text by symbols for retrieval — is a form of crude lan- 
guage translation. It is crude because the machine does 
not need to worry about grammar in the target lan- 
guage, since the grammar of descriptors is nonexistent, 
or at most, is rudimentary. As I see it, machine transla- 
tions of this kind for the purposes of information re- 
trieval will be an area of early pay-off for work in in- 
ductive inference machines. 

If inductive inference machines can be built at all, 
then it certainly should be possible for us to feed them 
with subject headings and classification numbers on the 
one hand, and with the titles of book chapters and sec- 
tion headings on the other hand, in order to teach the 
machines how to do at least some rudimentary kind of 
job of library subject cataloging. With librarians at 

15 R. J. Solomonoff, “An inductive inference machine,” 1957 IRE 
National Convention Record, pt. 2, pp. 56-62. 
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hand to provide suitable intervention or feedback, the 
machine’s performance should improve, and by further 
work, even the categories or descriptors which are used 
can be improved by the machine. Thus I feel that we 
can expect in the next few years to see some interesting 
results along this line. 

Goals for the Near Future 

There are a number of other applications of machines 
for purposes of information retrieval of a kind that have 
not yet been seriously undertaken, and others that have 
not yet been considered. In my discussion I shall bypass 
treating such useful and imminent tasks as the use of 
machines to store, transfer, and emit texts, so that at the 
time that you need to refer to a paper, even in an ob- 
scure journal, you can have a copy in hand within, say, 
twenty-four hours. Neither shall I consider the applica- 
tion of machines to the rationalization and automation 
of library ordering, receiving, listing, warehousing, and 
providing of documents. Neither shall I consider the 
application of machines to the integration of national 
and international library systems so that at any first- 
rate library, you will have at your command the cata- 
logs of the major collections of the world. These are all 
coming— but it should be noted with respect to them 
that the problems of human cooperation ranging from 
person-to-person to nation-to-nation cooperation are 
more serious than some of the machine and technical 
problems involved. 

The first of the rather unusual applications of ma- 
chines to information retrieval that I want to talk about 
can be introduced as follows. When a customer comes to 
an information retrieval system, he comes in a state of 
ignorance. After all, he needs information. Thus, his 
problem of knowing how to specify pieces of information 
that are unknown to him is a severe one. For one thing, 
the vocabulary of the retrieval system, and the usages 
of the terms in the system, may be slightly different 
from the language that he is used to. For another thing, 
upon seeing some of the information emitted according 
to his own retrieval prescription, he may decide that an 
entirely different prescription should be used. In short, 
the customer definitely needs help in using a machine 
information retrieval system, and this help should be 
provided by the machine. 

An indication of what kind of system needs to be pro- 
vided, and how it can be done, is given by certain of the 
simple sorted-card retrieval systems. Some of the sorted- 
card systems do very well in this respect, others do not. 
It has been common practice in Zatocoding systems, 
which use a simple schedule of a few hundred descrip- 
tors, to employ a descriptor dictionary system having 
many cross-references from words in the ordinary tech- 
nical usage to the appropriate descriptors . 9 Thus the 
customer can find his way into the system starting out 
with his own terminology. After the customer is referred 
to a descriptor, he finds there a carefully drafted scope 
note explaining the range of meaning attached to the 


particular descriptor. In another tabulation in the 
descriptor dictionary, the descriptors themselves are 
grouped or categorized into fifteen or twenty groups, 
and each group is headed by a question pertinent to the 
descriptors under it. Thus, under a question “Are geo- 
metrical shapes involved?” would be found descriptors 
such as “round,” “square,” “spherical,” etc. 

These simple card systems provide another source of 
assistance to the customer because they are able to emit 
cards within a minute or less from the time the retrieval 
search is begun. Thus if the search is headed into the 
wrong direction, the customer, upon looking at the 
cards or documents, will immediately detect this fact, 
and can reframe his request to the machine before any 
further searching is done. It is deplorable, but true, that 
many contemporary proposals for machine systems may 
be so slow in providing feedback that the feedback time 
is measured in hours*' or days, with the consequent waste 
of machine sorting time and accumulation of human 
frustration. 

The problems of customer assistance are going to be 
severe with the large machine retrieval systems of the 
future, and these problems must be faced. The de- 
scriptor vocabularies are going to be large. Another pos- 
sibility is that some of the machines will operate inter- 
nally on vocabularies or machine code systems that are 
quite unacceptable for external communication to the 
human operators. There has already been some success- 
ful experimentation with symbol systems of this kind in 
coding chemicals . 11,12 Such symbol systems work beauti- 
fully inside the machines, but people should not be 
forced to use them. For these reasons, in order to trans- 
late the customer’s requests into forms suitable for the 
machine, machine assistance is going to be desirable. 
Holt and Turanski see this problem of processing the 
customer’s request at the input to the machine as being 
very similar to the presently developing customer use of 
automatic programming for mathematical problems. 
The more advanced systems of automatic programming 
provide for a succession of stages of translation, with the 
symbolism at each stage moving further and further 
from the human word input to the abstract symbols and 
the detailed machine orders required for internal opera- 
tion of the machine. In mathematical programming, the 
machine programs itself, and then carries out the pro- 
gram. In retrieval programming, the machine will form 
the proper machine prescription, and carry out the 
search. To my mind, there is an important difference. 
In retrieval, the machine should check back with the 
customer as it builds up the prescription in order to 
make sure that the search will be headed in the right 
direction; then it should search a sample of the collec- 
tion and check again to make sure that the output being 
found is appropriate to the customer’s needs. If we are 
to have larger and more complex machine retrieval sys- 
tems, we must come to expect a great deal of back-and- 
forth man-machine communication during the formula- 
tion of a search, and as it is going on. 
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Quite another approach to handling the customer’s 
input problem is advanced by Luhn, who suggests that 
the customer write a short essay detailing what he 
thinks is descriptive of the information he wants . 16 The 
essay text would then presumably be handled in the 
fashion of the auto-abstract method (though Luhn is a 
little sketchy here on the details of his proposal), and 
the words selected from the short essay would be com- 
pared with words similarly selected from the document 
texts. When there is a sufficient degree of similarity, 
selection occurs. Although the Luhn proposal does put 
the load of translation of the customer’s request upon 
the machine, it does not provide for customer guidance 
into the resources of the machine’s selective language 
possibilities, or into the resources of the collection. Help 
in both of these directions would surely be of great as- 
sistance to a customer in extracting the maximum value 
from information in storage. 

Another possibility is to use an inductive inference 
machine, because it is open to learning a great variety 
of tasks. It would be able to provide a generalized ap- 
proach to the problem of customer assistance. But, how- 
ever customer assistance is provided, I think it is safe 
to predict that we must build information retrieval sys- 
tems with the planned capability to communicate back 
and forth with the customer so that he can better guide 
the machine in retrieving what will be useful to him. 

Retrieval Viewed as a Process of Education 

If the machine aids the customer by guiding him in 
the use of the retrieval system, the machine is neces- 
sarily educating the customer. Let us take this view- 
point, and look upon a machine retrieval system as an 
educational tool. This viewpoint provides a number of 
new tangents to consider. We have seen how the 
customer can use some coaching by the machine in order 
to tap efficiently the information resources during the 
search process. But, as anyone knows who has had a 
large batch of documents sent his way, maybe the 
customer can also use some machine help in reading the 
mass of documents emitted from a retrieval system! 

It is my prediction that some of the machine informa- 
tion retrieval systems of the future will go considerably 
beyond the tasks of mere retrieval or citing or providing 
document texts. I believe that some of them will also 
help the customer assimilate or read the output provided 
by the machine. This prediction is not at all fanciful, 
even though it is yet quite a way into the future. How 
far into the future it is we can only guess, or estimate by 
recalling that the Minicard follows a full twenty years 
after the first suggestions for a film selector, or that the 
widespread use of descriptors came about forty years 
after Taylor actually used something very much like 
them in information selection.' 

16 H. P. Luhn, “A statistical approach to mechanized encoding 
and searching of literary information,” IBM J. Res. Dev., vol. 1, 
p. 309; October, 1957. 


Machines can be very effective in teaching human be- 
ings. This is shown by the work of Skinner at Harvard 
where, in recent experiments, written modern languages 
and college mathematics have been set up on machine 
lessons . 17 Essential to the process is rapid feedback, or 
communication between the machine and the human 
learner, so that the human knows immediately that he 
is on the right track, and so the machine can apply cor- 
rective action as soon as errors appear. Skinner’s ma- 
chines at present employ written materials prepared in 
advance by human beings, the machine performing on 
the basis of a fixed internal sequence of morsels of in- 
formation of graded difficulty. However, machines need 
not be restricted to doing their teaching according to a 
preset sequence of lesson elements of this kind. In the 
same way that we are currently looking for techniques 
to allow machines to assign descriptors from texts, so 
can we contemplate the development of teaching pro- 
cedures and machines whereby the machines by them- 
selves will be able to pick out a graded sequence of in- 
formation morsels from the documentary record re- 
trieved and will then present them to the human 
customer. 

Taking this view of a machine information center act- 
ing both as a retrieval device operating upon a store of 
information and as a teaching device for the human 
customer, we can see that the process of input request 
formulation and the process of giving out information 
will merge into a sustained communication back and 
forth between the customer and the machine. Of course, 
once the customer is on the track of documents contain- 
ing information particularly pertinent to his interests, 
he will very likely desire to see the original text. This 
can be done, and a customer will have a choice of how 
much or how little of any particular actual document he 
wishes to read directly. 

The range of future possibilities is even greater when 
these ideas are combined with the possibilities inherent 
in mechanical language translation devices. Of course, 
we should expect that future information centers will be 
able to provide translation from one ethnic language to 
another of the texts that the retrieval system provides. 
Let us look further. As is well known, one of the prob- 
lems in machine language translation is to provide 
sentences in the target language in the required form— 
that is, to provide a smoothly running, colloquial trans- 
lation. For example, in going from German to English, 
we must rescue the verbs from the end of the German 
sentence and put them up where they belong in the mid- 
dle of the English sentence. Any machine capable of 
doing a high-grade language translation must be able to 
arrange and rearrange idea units and word units to make 
acceptable text out of them. This being the case, it is 
reasonable to predict that the information morsels that 
a teaching machine would put out could be given as in- 

17 B. F. Skinner, “Teaching machines,” Science, vol. 128, pp. 969- 
977; October 24, 1958. 
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put to a machine technique patterned on the output 
half of a language translator. The resulting textual out- 
put would be in the nature of a written article having at 
least some degree of acceptable style. 

This means that you could go to an information center, 
describe a certain kind of information needed, have 
the machine assist you in making your request more 
definite, and then order it: “Provide me with an 800- 
word article, not requiring more than an undergraduate 
chemistry background, on the deterioration of poly- 
isomers by sunlight.” After a short, but decent, interval, 
the machine would come forth with such an essay. 

There is an important corollary to this notion of the 
machine central being able to provide essay articles 
upon request. We are all aware of the considerable dupli- 
cation of information found in the technical literature. 
The same, or very similar, piece of information is re- 
peated in one article after another. Many articles are 
summaries of other articles, or are derivative upon 
other articles, and provide little or nothing that is new. 
If the output from an information system does not need 
to be in the form of a graphic image of the original text, 
or a type-out of the text itself, then it is possible to con- 
sider the storage of new information only in the ma- 
chine. A machine could store facts alone, and only new 
facts; it would not store text. By eliminating the de- 
pendence upon the original text, and avoiding the du- 
plication of the same information written over and over, 
it might be possible to secure considerable increase in 
the machine’s storage capabilities. 

Yet there are problems of a kind that will occur to 
any thoughtful individual. I do not think we want to 
throw away the original record which we have already — 
the printed books and articles in our libraries. Neither 
am I sure that we want to give up entirely our system of 
printed publication. But, putting these problems aside, 
let us do some more speculating. It might be possible for 
the scientist in his laboratory to feed his raw (or nearly 
raw) results directly into a machine for computation, 
checking for acceptability, correlation with earlier facts, 
and ultimate storage. Thus, instead of a scientific 
archive existing almost solely on paper, as we now have, 
it is possible that a part of the archive in the future will 
be in machine form. The only way that such a machine 
archive would be tapped would be by having the ma- 
chine write a summary or article upon specific request. 

When we are thinking about information machines of 
this kind, I wish to stress that we should not think in 
terms of some big single machine central. This is im- 
portant. It would be foolish and expensive to build up 


a single central “bottleneck.” If such machine central 
information systems as I describe will be at all possible, 
they will be important enough to be set up at a large 
number of installations, quite in the same way as we 
now are making use of a large number of electronic com- 
puter installations. There will be both large and small 
information machines. Some of these machines will be in 
intercommunication with each other, while others will 
operate in isolation. At various times, the machine 
memory from one or several of the machines can be 
played out onto tape, and the tape record, containing a 
vast amount of information, can be incorporated into 
the memory systems of many other information centrals. 

If machines can store and correlate laboratory facts, 
and can communicate with laboratory workers, we shall 
have to expect that the machines will find gaps in the 
information as a part of the correlation, and they will 
point out to the laboratory workers the need for further 
experimentation in certain areas. How far we can ex- 
pect this kind of active feedback to extend is hard to 
guess. The present work with pattern recognition will 
ultimately lead to a kind of a machine eye, and we al- 
ready have machine hands for the handling of radioac- 
tive materials. An information central machine system, 
aided by such receptors and effectors, would become, in 
effect, a laboratory scientist. 

At this point I would prefer to terminate my specula- 
tions on the excuse that we are now perhaps more than 
twenty years into the future, the limit that I set for my- 
self in this paper. 

In summary, I think that it can be said that mechani- 
cal information retrieval has started rather slowly; it 
has taken from about 1915 or 1920 until now to become 
as popular as it is. At the moment, except for certain 
highly integrated small retrieval systems, we are yet 
only dabbling in the subject. We do not now honestly 
have any appropriate large-scale machine for collections 
involving millions of items. We are only beginning to 
get a widespread recognition of the capabilities of suita- 
ble retrieval language systems, and there still remains 
the problem of getting machines with internal digital 
operations that are as suitable for retrieval and informa- 
tion work as the operations of addition and multiplica- 
tion are suitable for mathematical work. 

In any event, it is useful for us to know what some of 
our future targets are likely to be. With such knowledge, 
we will be in a better position to steer our activities in 
the present. This is the excuse for the predictions — 
which I take very seriously — that are contained in this 
paper. 
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Simulation of an Information Channel on 
the IBM 704 Computer* 


E. G. NEWMANt ahd L. 0. NIPPEf 


Introduction 


E XPRESSIONS for the probabilities of multiple 
, error patterns in symbols consisting of N bits are 
needed to evaluate the effectiveness of applying 
error correcting codes to these information channels. 
For example, if a Hamming 1 single error correcting code 
is applied to the information, the probabilities of all 
errors greater than one should be established. This will 
indicate the number of times the single error correcting 
code has failed. 

In addition, if the effects of channel asymmetry and 
regional error dependence on the probabilities of multi- 
ple errors are known, this information can be used to 
develop channel characteristics leading to the minimum 
number of errors. 

Expressions for error probabilities have been obtained 
for a binary symmetric channel 2 and for an asymmetric 
channel [see (2) or (6) in the Appendix] when errors are 
assumed to be independent. Attempts to introduce error 
dependence were made by using relationships between 
stochastic processes and familiar network concepts. 3 ’ 4 
All results, however, for even relatively simple cases 
became too unwieldy to be useful. Therefore, a simula- 
tion program which gives approximate values for the 
desired error probabilities was developed. 

Many practical information channels exist which ex- 
hibit both error dependence and asymmetry. An exam- 
ple of such a channel is a magnetic tape channel. A sym- 
bol of N bits can be recorded on N parallel tracks (Fig. 
1). The error dependence is primarily the result of de- 
fects extending over certain regions of the tape which 
influence the correct transmission of successive bits in a 
particular track or in a number of adjacent tracks. 

A single defect may produce errors on a single track 
only (Defect I, Fig. 1), or larger defects may cause errors 
in a number of adjacent tracks (Defect II, Fig. 1). These 
large defects produce multiple errors for all consecutive 
iV-bit symbols which fall into this defective region. 
Multiple errors can also be produced by the simultane- 
ous occurrence of different defects, each affecting the 
information in a particular bit of an 77-bit symbol 
(Defects I and III, Fig. 1). 


* This work has been supported by the U. S. Dept, of Defense, 
f IBM Product Dev. Lab., Poughkeepsie, N. Y. 

1 R. W. Hamming, “Error detecting and error correcting codes,” 
Bell Sys. Tech. J., vol. 29, pp. 147-160; April, 1950. 

2 L. Brillouin, “Science and Information Theory,” Academic 
Press, Inc., New York, N. Y., ch. 6, pp. 62-70; 1956. 

3 W. H. Huggins, “Signal flow graphs and random signals,” Proc. 
IRE, vol. 45, pp. 74-86; January, 1957. 

4 S. J. Mason, “Feedback theory — some properties of signal flow 
graphs,” Proc. IRE, vol. 41, pp. 1144-1156; September, 1953. 



Fig. 1 — Effects of regional defects on the generation of 
multiple errors in an iV-bit symbol. 


Depending on the type of recording used, certain 
types of defects may cause errors in the transmission of 
ones without affecting the transmission of zeros. Other 
types of defects or noise bursts affect the transmission 
of zeros only. Some defects can produce errors in both 
ones and zeros. These phenomena lead to channel asym- 
metry. The degree of this asymmetry is governed by the 
distribution of the various defect types. 

For example, for a modified nonreturn to zero type 
of recording (Fig. 2), a one is recorded by changing the 
magnetization of the tape from one saturation level to 
the other. A zero leaves the magnetic tape at a previ- 
ously established saturation level. As the read head 
senses only a change in the flux linking it, only ones 
produce signals. 

Tape defects in the form of high spots in the surface 
of the magnetic coating or loose particles of oxide ma- 
terial produce a head-to-tape separation, which results 
in a loss of signal. This affects the transmission of ones 
only. The characteristic loss in signal amplitude is 
shown in Fig. 2. This phenomenon is not unlike the 
amplitude “fading” of radio signals and may, at high 
recording densities, produce a number of consecutive 
errors. 

Other tape defects, such as pin holes in the magnetic 
coating, can produce errors in the transmission of zeros 
(Fig. 3). At the boundaries of the hole in the magnetic 
coating, a flux change will be sensed by the read head 
as it moves from a region free of magnetic particles to 
one where magnetized magnetic particles exist. This 
change of flux could be sensed as a one in place of a zero. 
In addition, amplifier noise can occasionally exceed a 
certain clipping level. If this noise peak occurs at the 
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Fig. 2 — Effect of tape defect causing head-to-tape separation on 
signal amplitude (modified nonreturn to zero recording). 


Fig. 3 — Effect of defect in magnetic oxide (pin hole) on signal 
amplitude (modified nonreturn to zero recording). 


time when a zero should be read, it may be sensed in- 
correctly as a one. 

If the hole is large, ones recorded in this region will, 
of course, not be sensed. A hole in the magnetic oxide or 
a nonmagnetic particle in the magnetic coating can, 
therefore, produce errors in both ones and zeros. 

These are some typical examples of the error- 
producing mechanisms which cause the complex char- 
acteristics of the magnetic tape channel. 

Channel Simulation 

Because of the difficulty of obtaining analytic expres- 
sions for the probabilities of multiple error patterns in 
terms of the complex characteristics of a tape channel, 
simulation means had to be developed. 

The simulation procedure can best be described by 
comparing it to the manufacturing of tape. The defects 


single track or at the most a few adjacent tracks. A 
theoretical description of the channel characteristics 
can also be used. The program is completely flexible, 
that is, tape of any specifications can be “manufac- 
tured.” 

Random information could now be “recorded” over 
the whole length of the hypothetical tape and errors in- 
troduced in accordance with the defect type. The pro- 
gram, however, concerns itself only with errors. Thus, 
information is placed only into the defective regions and 
the resulting errors are classified and counted. This re- 
sults in a considerable saving of computer time. 

The 704 Program 

The 704 Information Channel Simulator Program 
(Fig. 4) might best be considered by breaking the pro- 
gram into three major phases as indicated below: 


Phase I 


Phase II 


Phase III 

Location Generator 


Sort 


Error Analysis 

Random Generator for 


Sort by Defect 


Classification and 

Defect Location 


Location 


Count of Errors 


in the tape, produced during the manufacturing process, 
can be considered to be distributed in a random manner 
over the length of the tape. The program simulates this 
process by assigning random locations to each of the 
defect regions, as they are read from a list of inputs. 
This list gives the number and class of all the various 
defects which must be placed on a particular length of 
tape. This defect listing can usually be prepared on the 
basis of error statistics obtained from tests involving a 


Phase I might be called the “tape manufacturing” phase 
because it is responsible for the actual placement of de- 
fects on the hypothetical tape. All defects are assumed 
to be placed on the tape simultaneously and previous 
to the analysis phase. 

Since the defect locations are generated randomly, it 
is next necessary to sort (Phase II) the defects by loca- 
tion so that they may later be processed in sequence. 

Phase III performs the random generation of infor- 
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Fig. 4 — General flow diagram. 

mation to be written on the tape, the analysis of the in- 
fluences of each defect on this information, and the 
classification and counting of resulting errors. 

Input Data 

Before discussing any of the three phases of the pro- 
gram in detail, it is necessary to have an accurate pic- 
ture of the input data, which are prepared on the basis 
of actual test data or from a theoretical description of 
the channel characteristics. 

To formulate the input data for the program the fol- 
lowing statistical information is needed : 

1) The a priori probabilities that defects of a particu- 
lar length will occur on a track. These defects can 
produce errors in ones or zeros or both on this 
track only. 

2) The a priori probabilities that “centers” of large 
defects of a particular size will occur on a particu- 
lar track. These defects can produce errors in ones 
or zeros or both in a region which encompasses a 
number of adjacent tracks. 

For program use, the above information is converted 
into a set of values K, L, T, and C, where 

K = the number of defects per track of a certain type 
to be placed on the hypothetical tape, 

L = the length of the defect in bits, 

T — the track number of the defect center, and 

C = the defect class, and indicates whether the defect 
involves errors in ones, zeros, or both, and 


whether one, two, or three adjacent tracks are 
involved. For the defects which produce errors in 
more than one track, the same defect length per 
track was assumed. 

The above set of values is defined as an input data set. 
A different set may be used for each track. 

Phase I — Location Generator 

Each input data set read per track represents K de- 
fect areas. In order to assign a random longitudinal loca- 
tion X to each, a random number modulo M is gen- 
erated. The pseudo random number generator program 
used here is PE RAND. It produces a 35-bit random 
number by multiplying two odd, 35-bit numbers, 
selected from a group of ten such numbers, to produce 
a 70-bit product. The center 35 bits of this product are 
used as the random result and may be divided by a 
previously specified number, M, to make the result 
modulo M. This generator has been thoroughly tested. 
The probability that any bit of the resulting random 
number is a one lies between 0.45 and 0.55. This was 
considered entirely satisfactory for this application. 

This defect location just generated is stored along 
with the appropriate length, track, and class. Any given 
defect may involve one or more adjacent tracks as spec- 
ified by its class. Arbitrarily, the program was written 
to handle defects extending over no more than three 
adjacent tracks. The program converts these adjacent 
track defects to individual track defects of equal length 
and assigns the same location to each. 

Phase II — Sort 

After a random location has been generated for each 
defect, these defects are sorted by location fields. There 
are two general sort programs available for the 704 
through the SHARE organization. Either may be used, 
or the sorting may be done on another machine. 

Phase III — Error Analysis 

It can be seen from the flow diagram of Fig. 4 that 
there are three basic operations other than decision 
making that must be performed by this phase. They are 

1) Comparison of defect locations, 

2) Random generation of information in the form of 
iV-bit symbols, and 

3) Classification and counting of errors within the 
affected A-bit symbols. 

As each defect is read from the sorted list, its location 
must be compared with the location of other defects to 
see if any of the defects overlap. When defects on dif- 
ferent tracks overlap, multiple errors may result. When 
defects on the same track overlap, they are joined 
into one defect. 

When the number of overlapping defects changes, a 
new “section” is formed (Fig. 1). The program keeps 
track of the number of overlaps involved at any instant. 
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The number of bits involved in these overlapping 
defects represents the maximum number of errors which 
can occur in the symbols in the overlap. For example, 
consider Section 2 in Fig. 1. This particular section is 
two bits long and involves Defects I and II. Thus, no 
more than four bits may be in error in this particular 
overlap, two bits in each of two 2V-bit symbols. The 
program provides three counters for double, triple, and 
higher order overlaps. 

Next, the program must generate random informa- 
tion for each iV-bit symbol in the overlap. Considering 
each bit individually, it will be in error if any one of the 
following three conditions exists: 

1) The bit is a one and the defect produces errors in 
ones, 

2) The bit is a zero and the defect produces errors in 
zeros, 

3) The defect produces errors in ones and zeros. 

A test for errors is now made on the basis of these three 
conditions. The previously described PE RAND pro- 
gram is used to generate the random information. 

Each bit in the affected 2V-bit symbol is tested, and, 
if an error has occurred, the appropriate error counter 
is updated. The program provides counters for double 
and triple errors 

Machine Time on the IBM 704 Computer 

Table I indicates approximate machine times per 
phase for typical runs involving 3500 and 10,000 defect 
regions, respectively. It shows that the time required to 
run Phases I and III is linearly related to the number 
of defect regions involved, while the time for Phase II 
(Sort) increases more rapidly with the number of de- 
fects. This indicates that a larger number of runs involv- 
ing fewer defects will give shorter over-all machine 
times. 


TABLE I 

704 Machine Time in Minutes per Run 


Number of Defect 
Regions 

Phase I 

Phase II 

Phase III 

Total 

3500 

• 3 

4 

4 

11 

10,000 

9 

22 

12 

43 


Results of Simulation Program 

The program was tested by using it to simulate a sym- 
metric binary channel with no error dependence. For 
this channel, exact error probabilities can be computed 
[see (1) in the Appendix], Results of these computations 
were compared with results of the simulation program. 
A few of these results are shown in Fig. 5 for N equal 
to 24, 12, and 8. This channel had an a priori error prob- 
ability per track of £ = 2X10~ 3 . The average error for 
the double error probability, as compared with the 
theoretical results, varied in these examples from 0.76 
to 1.77 per cent. 


TOTAL NUMBER OF DOUBLE 
ERRORS FOR 3 RUNS. 1673 


AVERAGE ERROR = I 01% 



Fig. 5 — Error in per cent for double error probability. Symmetric 
channel (defect length = 1); a priori error probability, p = 2X10 -3 . 


As may be expected, the accuracy of the results de- 
pends on how many runs have been made. The number 
of runs required will be different for each problem and 
will depend on how long it takes to accumulate a sig- 
nificant count for the various error patterns of interest. 
The theoretical limit to the final accuracy of the results 
of the simulation program is set by the inherent in- 
accuracy of the pseudo random number generator and 
on the accuracy with which the mathematical model 
describes the actual information channel. 

The simulation program has been designed for infor- 
mation channels with rather complex characteristics. 
An example of such a channel is a Z channel. In a Z 
channel, one of the binary symbols is always trans- 
mitted correctly; thus, it is a channel with the greatest 
possible degree of asymmetry. For the example selected, 
the probabilities of occurrence of defects of various 
lengths are shown in Fig. 6. This hypothetical distribu- 
tion was developed to study the effect of channel asym- 
metry and error dependence on error probabilities. In 
this distribution, the defects contain as many single bits 
per track as in the previous example for the symmetric 
channel. 

For this Z channel, the approximate double error 
probabilities for N = 24, 12, and 8 are shown in Fig. 7. 
The results for this example indicate that good approxi- 
mations for multiple error probabilities can be obtained 
by assuming that each long defect region is split up into 
as many separate regions one bit long, as there are bits 
in the original defect region. 

The multiple error probabilities for this Z channel 
(pi = 2X10~ 3 ) can be computed, using (3) or (8) in the 
Appendix. 
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DEFECT LENGTH IN BITS 

Fig. 6 — Hypothetical probability of occurrence of 
defects of various lengths. 

Summary 

The results of the simulation program indicate that 
good approximations to the probabilities of multiple- 
error patterns in symbols consisting of N bits can be 
obtained. 

The computer time required for the simulation is 
quite reasonable and compares favorably with the costs 
for the testing of numerous complete versions of a sys- 
tem. The simulation procedure is, therefore, particu- 
larly useful during the early stages of development. 

The program, because of its flexibility, also lends it- 
self to purely theoretical investigations of error 
statistics. 



Fig. 7 — Double error probability for a Z channel for 
error dependence shown in Fig. 5. 



RECEIVED 

SYMBOL 


Fig. 8. 


Appendix 


Expressions for Error Probabilities for an 
Asymmetric Binary Channel with 
N o Error Dependence 


Pn\z) 





( 1 ) 


The binary asymmetric channel can be represented 
by Fig. 8 where 

g 0 = the probability of a transmitted zero being re- 
ceived as a zero, g 0 = 1 — po, 
gi = the probability of a transmitted one being re- 
ceived as a one, qi = l—pi, 
po = the probability of a transmitted zero being re- 
ceived as a one, 

/>i = the probability of a transmitted one being re- 
ceived as a zero. 

Let there be N bits in a character, of which r are ones 
and (N — r) are zeros. The probability Pn(z) or z errors 
in a character consisting of N bits can be expressed for 
a symmetric channel 1 where p\ = po. 


For an asymmetric channel, if all iV-bit symbols are 
assumed to be transmitted with equal probability, 


b I St) [£{(t> «''™} 

• {(yEy) 




and, for a Z channel, po — 0 and q 0 = 1. 


1 (!=* /N\ / r\ 

P N "'(z) = -|Z (- ( — XpCqC 

2 N l r =z \ r / \ z / 
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where 



N\ 

xl(N — x ) ! 


(4) 


Simplifications can be made in (1)— (3), if systems of 
inherently high reliability are considered. That is, if 


and, if pi = apo, 


Pn"{*) = 



( 7 ) 


Pn'"(z) = 



( 8 ) 


i>i « 1, po«l. (5) 

Expanding (1 —p) k in terms of a power series and re- 
taining only the first term, (1)— (3) , respectively, become 

Pn'(z) = (~)Pi z ( 6 ) 
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A Compiler with an Analog-Oriented Input Language 

M. L. STEINf, J. ROSEJ, and D. B. PARKER! 


Introduction 

ANALOG computation is, for many problems, more 
/—\ convenient than digital computation but lacks 
the precision obtainable from a digital-computer 
solution. A compiler has been developed which, for 
problems involving differential equations expressible in 
the form 

Pi = f(yi, ?*,*••, Pn), i = 1 , 2, • • • , n (1) 

combines to a considerable extent the desirable attri- 
butes of both types of computation. The compiler 
achieves this by deducing from a description of an 
analog setup diagram the differential equations which 
the analog computer solves, and then compiling a pro- 
gram for solving the equations. 

Two drawbacks are avoided. The differential equa- 
tions represented by a diagram are deduced with no 
attempt to simulate the analog computer, and a sophis- 
ticated integration procedure is used. Therefore, an 
accurate and relatively efficient digital-computer pro- 
gram is obtained. Even though the solution is obtained 
from the differential equations, the identity of the out- 
put of each element on the diagram is not lost, so that 
the results may be visualized more easily as the re- 
sponse to the physical system being studied. 

The integration procedure used in the final program 
is the Gill 1 version of the fourth-order Runge-Kutta 

f University of Minnesota, Minneapolis, Minn. The work of this 
author was supported in part by Convair-Astronautics. 

t Convair (Astronautics) Div., General Dynamics Corp., San 
Diego, Calif. 

1 S. Gill, “A process for the step-by-step integration of differential 
equations in automatic digital computing machines, ” Proc. Cambridge 
Phil. Soc., vol. 47, pp. 96-108; June 3, 1950. 


method. Since no starting procedure is required, and 
because the discontinuities introduced by nonlinear 
analog elements cause no difficulty, this method was 
adopted. 

The compiler is not the first digital computer program 
with an input language related to differential analyzers; 
for example, there are the programs DIDAS 2 and 
DEPI. 3 However, the program which- this paper de- 
scribes differs from other similar programs of which the 
authors are aware in one or more of the following 
respects : 

1) The input language is closely related to an exten- 
sively used electronic analog computer. 

2) The user need not provide his own input and out- 
put program. The compiler provides a complete 
program ready to run. 

3) Since the final program produced is in machine 
language, it is efficient in terms of execution time 
as compared to an interpretive program. 

4) Rather than simulating a differential analyzer, the 
compiler deduces from a setup diagram the dif- 
ferential equations represented by the diagram. 

In producing a program to solve the differential equa- 
tions expressed by an analog setup diagram, the com- 
piler uses a technique of increasing popularity. 4 This 
technique is the use of another processor as an inter- 

2 G. R. Slayton, “DIDAS,” presented at the Twelfth Natl. Meet- 
ing of the Assoc, for Computing Machinery; June, 1958. 

3 F. H. Lesh and F. R. Curl, “DEPI, An Interpretative Digital- 
Computer Routine Simulating Differential-Analyzer Operations,” 
Jet Propulsion Lab., California Inst. Tech., Pasadena, Calif., Memo. 
No. 20-141; March 22, 1957. 

4 Communications of the Assoc, for Computing Machinery, vol. 1, 
no. 7, p. 5; July, 1958. 
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mediate step. In this case the intermediate processor is 
Fortran, 5 an automatic coding system developed by 
IBM which accepts statements closely resembling the 
ordinary language of mathematics as input. The output 
of the compiler is in the input language of Fortran, and 
is translated by Fortran into machine language. 
Through the use of Fortran, the task of developing the 
compiler was greatly simplified. 

The method used to deduce the differential equations 
represented by a setup diagram through analysis of its 
description is developed in a previous paper by two of 
the authors. 6 Therefore the method will not be devel- 
oped here. Instead it will be illustrated by example. A 
description of the preparation of problems for the com- 
piler and a description of the compiler will be given. 
The application of the compiler will be illustrated with 
the solution of a simple problem, and in conclusion, ex- 
perience in its use will be discussed. 

The Analog Setup Diagram Description 

The conversion of an analog setup diagram to a 
digital-computer program involves several steps. The 
diagram is described. The description is processed by 
the computer and a Fortran program is produced. The 
program is compiled by Fortran into a machine lan- 
guage program ready to be run. All of these steps except 
the first are performed by an IBM 704. This first step, 
the description of the diagram, will now be discussed. 

Analog Elements Recognized 

In order to analyze the description of an analog setup 
diagram, the compiler must be able to recognize the 
more commonly used analog elements. Those elements 
available on the Electronics Associates PACE elec- 
tronic analog computer were chosen as representative. 
These elements are listed in Table I. The diagram sym- 
bols for the elements are those most commonly used for 
PACE setup diagrams. The mathematical expression 
for the function of each element is chosen so that prob- 
lems prepared for PACE computers can be converted 
to digital programs with little or no change in the setup 
diagram. For this reason, the scale factors associated 
with multipliers and dividers are included, and resolvers 
involve angles measured in volts where one volt equals 
two degrees. 

Normally, scaling requirements for an analog com- 
puter are quite restrictive; but since the operations in 
the digital computer program are performed in floating- 
point form, all numbers, including parameters associ- 
ated with elements, can vary over a much wider range 
in the digital computer. Therefore, diagrams to be 
processed by the compiler need not be scaled for an 
analog computer. 

6 J. W. Backus, et at., Programmers Reference Manual for the 
Fortran Automatic Coding System for the IBM 704; December, 1957. 

6 M. L. Stein and J. Rose, “The Automatic Deduction of Differ- 
ential Equations from Analog Setup Diagrams,” Mathematical Pre- 
Print Series, Pre-Print No. 13, Convair-Astronautics, San Diego, 
Calif. 


In addition to the use of amplifiers as summers and 
integrators, special one-amplifier circuits representing 
more complicated transfer functions are sometimes used 
on electronic analog computers in order to save analog 
elements. Such use of amplifiers is not permitted on 
diagrams whose description is to be processed by the 
compiler. One-amplifier circuits generating special 
transfer functions must be replaced with equivalent 
circuits using elements in Table I. 

Two of the elements, element 13 and element 14, 
listed in Table I do not correspond to actual analog ele- 
ments, but have been included for convenience and for 
the compilation of more efficient digital programs. Ele- 
ment 13, the Fortran statement, is included to allow 
the replacement, if desired, of feedback loops generating 
functions such as x 2/3 , e x , etc., by a Fortran statement 
utilizing a library subroutine in order to obtain a more 
efficient digital program. Element 14, the external input, 
is included to allow the combination of compiler-gen- 
erated Fortran programs with other Fortran programs. 

The Preparation of a Diagram Description 

The compiler converts an analog-computer problem 
to a program by processing information supplied in a 
description of the analog setup diagram. Therefore, the 
preparation of an accurate diagram description by the 
user of the compiler is an essential step in the process of 
obtaining a correct digital program. 

Before beginning the description of a diagram, how- 
ever, it should be determined that the problem is of a 
type suitable for conversion. While the analog computer 
is capable of solving several types of problems, the com- 
piler is restricted to problems involving differential 
equations expressible in the form (1). That is, no im- 
plicit relationships are allowed among derivatives or 
among other variables. The compiler detects implicit 
relationships by discovering the presence on the diagram 
of feedback loops without integrators, a procedure 
whose suitability for detecting implicitness is demon- 
strated in a previous paper. 6 Therefore, the diagram 
should be examined for such loops before preparing the 
description. 

Having determined that the problem is of a suitable 
type, the diagram must be examined for elements other 
than those listed in Table I. Circuits involving other 
elements must be replaced by equivalent circuits using 
acceptable elements. Circuits for obtaining powers, 
roots, arctangents, exponentials, and natural logarithms 
may be replaced, if desired, by Fortran statement ele- 
ments. 

After it has been determined that the problem is of a 
suitable type and that all circuits are acceptable, the 
actual process of describing the diagram is clerical in 
nature. Consequently, the description may be done by 
an engineering aide, freeing the engineer who originated 
the problem for tasks making better use of his skills. 

The first step in preparing a diagram description is to 
assign to each element on the diagram an arbitrary 
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TABLE I 

Elements Recognized by the Compiler 
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number between 1 and 999. Using a suitable input form 
of the type shown in Fig. 1, the elements are described 
one at a time in any convenient order. An element’s 
number is entered on the form. Then its type is desig- 
nated as given in Table I — for example, SU for a sum- 
mer. If the element has parameters associated with it, 
the values of the parameters are entered on the form. 
If the output of the element is to be printed, an N for 
normal output or a C for checkout output is written on 
the form. The inputs to the element are then listed by 
entering the numbers of the elements from which the 
inputs come. If the element is an integrator or a sum- 
mer, the scale factors associated with the inputs are 
listed. As an example, Fig. 2 shows the complete entry 
for element 2, an integrator whose single input is equal 
to five times the first output of element 6, an electronic 
multiplier, and whose initial output is —45. 

Each element is described in a fashion similar to that 
above on one line of the input form. When all elements 
have been described and the descriptions checked, the 
form is given to a key-punch operator, who punches 
each line on a card. These cards become the input to the 
compiler. 

The Transformation Process 

In order to transform a description of an analog setup 
diagram into a digital-computer program, the compiler, 
using a procedure developed in a previous paper, 6 must 
deduce the differential equations which the diagram 
represents. Having deduced the equations, the compiler 
must then produce a Fortran program suitable for 
solving the equations. These two major phases of the 
transformation process will now be described and will 
be related to the complete compiler program. 

First, the differential equation of a simple diagram 
will be deduced. In Fig. 3 we see the diagram for the 
equation 

y = R - Ky. (2) 

PI represents the output of element 1, P2 the output of 
element 2, etc., and D2 represents the total input to 
element 2, an integrator. Examination of the types of 
elements 1 and 3 (see Table I) permits the equations 

PI = R (3) 

P3 = KP2 (4) 

to be written. Examination of element 2 discloses that 
it is an integrator. The equation for its input can be 
written as 

D2 = + PI + P3 (5) 

If (3) and (4) are substituted into (5), the equation 

D2 = R + KP2 (6) 

is obtained. Recalling that 





Fig. 2- — Input form entry for one element. 



Fig. 3 — Diagram for y — R—ky. 


as shown in Table I, (6) is seen to be equivalent to (2). 
Therefore, given values for C (the initial value of P2), 
R, and K, a solution for (2) can be obtained through use 
of a suitable numerical integration procedure. 

Actually, (3)-(5) need not be combined if they are 
evaluated in the proper sequence, since all that is re- 
quired to carry out the numerical integration is that 
it be possible to obtain a value for D2, given a value of 
R, K, and P2. If equations are not to be combined, de- 
ducing the differential equations from a diagram de- 
scription consists of determining the proper sequence 
for evaluating each element and then producing equa- 
tions for evaluating the output of each element with the 
exception of integrators, whose input must be evaluated 
as seen above. Leaving equations uncombined leads to a 
somewhat less efficient computer program. However, it 
has the advantage of preserving the identity of the out- 
put of every element, a useful property when programs 
are debugged, and also aids in visualizing the results as 
the response of the physical system being studied. Since 
a procedure for deducing differential equations is easier 
to implement, too, if equations are left uncombined, 
this method was chosen for the compiler. 

If equations are not to be combined, the process of 
converting a description to Fortran input language can 
be divided into two distinct phases: 1) the determina- 
tion of the proper sequence for evaluating integrator 
inputs and nonintegrator element outputs, and 2) the 
generation of suitable Fortran statements for obtaining 
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the value of these inputs or outputs as a function of the 
outputs of other elements. The former phase, sequenc- 
ing, will now be considered. 

Sequencing 

In order to give a pictorial presentation of the se- 
quencing process, a table giving the proper sequence 
for the evaluation of the elements of an abstract analog 
diagram will be constructed in a fashion similar to that 
which the compiler would employ. 

An analog setup diagram is given in Fig. 4 for a first- 
order differential equation; that is, only one integrator 
appears. Since the sequencing process need distinguish 
only between elements which are integrators and those 
which are not, types of elements other than that of ele- 
ment 7, an integrator, are not specified. Given a de- 
scription of the diagram, the first step in constructing 
a sequence table is to locate an integrator, in this case 
element 7, and enter its number in a sequence table. 
Examining the description of element 7, one of its in- 
puts, element 3, is selected for consideration, the number 
of 3 is entered in the sequence table and its description is 
located. Proceeding to 3’s input 5 and entering 5 in the 
table, the description of the portion of the diagram indi- 
cated by the heavy line in Fig. 5 has been examined, 
and the entries in the sequence table shown in Fig. 5 
have been made. Since 5 has no inputs, element 3 must 
be reselected and its description examined for additional 
inputs which have not yet been selected. Element 3 
has a second input, 1, which has not yet been examined. 
Proceeding in the manner similar to that above, 1, 6, 
and 4 are entered in the table as shown in Fig. 6. After 
4 has been entered in the table, since 4 has no inputs, 1 
is reselected, but both of its inputs have been examined. 
Therefore 3 is reselected, but both of its inputs have 
been examined also. This leads to the reselection of 7 
and the examination of its second input, 8. 

The tracing of the inputs to 8 will disclose an interest- 
ing feature of the sequencing process. This feature is 
that elements with split outputs such as element 1 cause 
duplication of entries in the sequence table. Examina- 
tion of the description of 8 reintroduces 1 for selection. 
Examination of 1 reintroduces 6 and 4 as shown in Fig. 
7. Since 4 has no input, and both inputs of 1 have been 
examined, 8 is reselected. Element 8’s second input is 
2, and 2’s input is 9. Therefore, 2 and 9 are entered in the 
table. The reselection of 2 and the examination of its 
second input discloses that its second input is element 
7, an integrator. 

Inputs to elements which are the outputs of integra- 
tors are treated as a special case. The element number 
of an integrator is never entered in a sequence table 
except as an initial entry. The reason : no expression for 
evaluating the output of an integrator need be written. 
The value of the output of the integrator is supplied 
by the numerical integration. Since 2’s second input is 
7, an integrator, no table entry is made. 



Fig. 4 — An abstract analog setup diagram. 


Sequence Table 



Fig. 5- — Partially examined diagram description. 



Fig. 6 — Partially examined diagram description. 


Sequence Table 



Fig. 7 — Partially examined diagram description 
with reintroduction of 1,6, 4. 


Sequence Table 



Fig. 8— -The completed examination of the diagram description. 

At this point, all inputs to every element of the dia- 
gram have been examined and the sequence table is 
complete. The compiler discovers that the examination 
of the diagram description is completed in the following 
fashion. Since both inputs to 2 have been examined, 8 is 
reselected for examination. Both inputs to 8 have also 
been examined. Consequently 7 is reselected, but both 
inputs to 7 have been examined and 7 is the initial entry 
in the sequence table. Thus, examination of the diagram 
description is completed as shown in Fig. 8. 
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While the examination of the diagram description is 
complete, the sequence table is not yet finished. Re- 
dundant element numbers must be eliminated with the 
lowest occurrence of a number retained as shown in 
Table II, and the table must be inverted and compressed 
as shown in Table III in order to obtain a proper se- 
quence for evaluating the elements. 


TABLE II 

Sequence Table with 
Redundancies Eliminated 


TABLE III 

Final Sequence Table 
(Table II Inverted) 


7 


9 

3 


2 

5 


4 

8 


6 

1 


1 

6 


8 

4 


5 

2 


3 

9 


7 


The final sequence table, Table III, has been ob- 
tained. Comparison of Table III with the diagram of 
Fig. 8 shows that if the elements are evaluated in the 
order given by Table III, the output of each element 9 
through 3 can be calculated and the input to element 
7 obtained. Using a suitable numerical integration pro- 
cedure, the output of 7 can be obtained. Hence, a proper 
sequence for solving the differential equation repre- 
sented by the diagram has been determined. 

The illustration above is for a single differential equa- 
tion. If the diagram involves several differential equa- 
tions, that is, several integrators, a slight elaboration of 
the procedure used above is required. An integrator is 
found and a sequence table is constructed as shown 
above. All elements in the final sequence table are de- 
fined as being complete. Another integratior is found, and 
the procedure above is repeated with the additional pro- 
visions that those inputs which come from complete ele- 
ments give rise to no table entries, and that complete 
elements are not selected for examination of their inputs. 
The final sequence table obtained for the second inte- 
grator is added to the sequence table for the first. The 
process is continued until all integrators have been 
found and a sequence table has been obtained for the 
entire diagram. 

A previous paper 6 proves that the procedure demon- 
strated above does obtain in all cases the proper se- 
quence for diagrams involving equations of form (1) and 
develops a method for, detecting equations not of form 
(1). This method was not demonstrated above, but is 
included in the compiler. 

Generation of Fortran Statements 

The second phase of converting a setup diagram to 
Fortran input language, the generation of suitable 


Fortran statements for evaluating elements, will now 
be considered. 

In contrast to the sequencing phase, the generation 
of Fortran statements is lengthy but logically simple. 
The compiler begins with the first entry in the sequence 
table, and an element number is obtained. The descrip- 
tion of the element is located, and its type is examined. 
On the basis of its type, a set of skeleton Fortran state- 
ments is selected, a set being provided for each type of 
element. The inputs to the element are obtained from 
its description and are inserted in the proper places in 
the skeleton statements. The statements are written on 
magnetic tape. Then the next entry in the sequence 
table is obtained and the process repeated. In Table IV 
we see examples of Fortran statements generated for 
each type of element. 

In addition to generating statements for each element 
as described above, the compiler generates such other 
Fortran statements as are required to obtain a complete 
program. 

The Complete Compiler Program 

The complete compiler program including the se- 
quencing and statement generation portions will now 
be described in terms of the functions which the pro- 
gram performs. These are as follows: 

1) It tabulates the data supplied in the description of 
the analog setup diagram. 

2) The differential equations represented by the dia- 
gram are deduced through determination of the 
proper sequence for evaluating elements. 

3) Fortran statements providing a complete program 
are generated. 

4) As the data of the diagram description is processed, 
it examines the data for errors. 

These functions divide the compiler into four major 
segments — the input routine, the sequencer routine, the 
Fortran statement generator, and the error diagnostic 
routine. 

The Input Routine: The input routine reads the de- 
scription of the elements into the computer memory and 
forms various tables. These are a table of elements 
which have associated parameters, a table of the values 
assigned to these parameters, a table of elements which 
are integrators, and a table of those elements whose 
output is desired for printing. 

The Sequencer Routine: The sequencer routine forms 
a table showing the order in which integrator inputs and 
nonintegrator element outputs must be evaluated. 

The Fortran Statement Generator : The Fortran state- 
ment generator produces a Fortran program the parts 
of which are concerned with the following operations: 

1) Assignment of memory locations for data; 

2) Assignment of values to all parameters associated 
with elements including the initial values of inte- 
grator outputs; 
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TABLE IV 

An Example of Fortran Statements for Each Type of Element 


Element 

Statement Number 

Statement 

reference 

00101 

P101 =XR101 

potentiometer 

00002 

P002 =XK002 * P001 

summer 

00016 

P016= -P001 -P002 -5. * P0051 -10. * P006 

integrator 

00004 

P004= +P003 + 10. * P0U +5. * P0071 

servo multiplier 

00005 

P0051 =P011 * P007 1/100. 

P0052 = P011 * P001/100. 

P0053 = P011 * P003/100. 

electronic multiplier 

00007 

P0071 = -P003 * P005 1/100. 

P0072 = — P003 * P002/100. 

divider 

00008 

P008= -100. * (P0071/P019) 


00095 

IF (P094-XU095) 02095, 01095, 01095 


01095 

P095 = XU095 

GO TO 05095 

limiter 

02095 

IF (P094-XL095) 04095, 04095, 03095 


03095 

P095 = P094 

GO TO 05095 


04095 

P095 = XL095 


05095 

CONTINUE 

Fortran statement 

00015 

P015 = P009 * * 1.5+EXPF (P012-17.6) 

external input 


no statement required 


00034 

P0341 = SQRTF (P029 * *2+P028 * * 2 ) 

IF DIVIDE CHECK 01034, 01034 


01034 

IF QUOTIENT OVERFLOW 02034, 02034 


02034 

TANP2 = P028/P029 

IF DIVIDE CHECK 04034, 03034 

rectangular 


IF QUOTIENT OVERFLOW 04034, 07034 

to polar resolver 


IF (P028) 05034, 20000, 06034 

laial 1 

PN2= -45. 

GO TO 08034 


06034 

PN2 = +45. 

GO TO 08034 


07034 

PN2=ATANF (TANP2) * 28.6478897 


08034 

IF (P029) 09034, 11034, 11034 


09034 

P0342 = PN2 +90. 

GO TO 12034 


11034 

P0342 = PN2 


12034 

CONTINUE 

polar to 

00022 

P0221 =P023 * SINF ( .03490658 * P008) 

rectangular resolver 


P0222 = P023 * COSF (.03490658 * P008) 

P0223 = P025 * SINF (.03490658 * P008) 

P0224 = P025 * COSF (.03490658 * P008) 


00100 

IF (P011) 01100, 02100, 02100 


01100 

P1003 = P098 

P1006 = 0. 

switch 


GO TO 03100 


02100 

P1003 = P099 

P1006 = 0. 


03100 

CONTINUE 


00012 

IF (P003 — V012(l) 30000, 01012, 01012 


01012 

DO 02012 K-l, M012 

function 


IF (P003-V012 (K))03012, 02012, 02012 


02012 

CONTINUE 

generator 


GO TO 30000 


03012 

P012 =W012 (K)-(W012(K)-W012 (K-l))* 

(V012(K) — P003)/(V012(K)— V012(K — 1)) 
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3) Read-in of tables for function generators, of pa- 
rameter changes, and of various constants associ- 
ated with a particular run — the integration inter- 
val, the printout interval, the type of output, the 
cutoff time, and the number of parameter changes; 

4) Numerical integration; 

5) Evaluation of derivatives, that is, the computation 
of the outputs of all elements which are not inte- 
grators and computation of the inputs of inte- 
grators ; 

6) Finally, provision for printing output and testing 
for cutoff time. 

The Error Diagnostic Routine: The detection of errors 
occurs in either the input routine or the sequencer. The 
function of the error diagnostic routine is to print a com- 
plete explanation of the type of error detected, the 
numbers of the elements involved, and other pertinent 
information to enable the user to locate and correct 
errors in the diagram or in its description. The error 
diagnostic routine must also make such changes in the 
diagram description as are required in order to allow the 
compiler to continue to the end of sequencing so that 
several errors may be detected in one compilation 
attempt. 


An Example of the Use of the Compiler 

The solution of a simple problem will now be pre- 
sented as an illustration of the use of the compiler. The 
authors are indebted to Robert Ferner of Convair- 
Astronautics for the preparation of the problem given 
below. 

The problem to be solved involves the testing of a 
possible transfer function for translating an error in 
velocity normal to a reference trajectory of a missile 
into a turning rate to the missile’s autopilot. The auto- 
pilot is approximated by the transfer function, 

10 

G (7) 

S 2 + 45 + 10 


The acceleration of the missile is given by 


A = 



100,000-32.2 


50,000 



500 ( 7 / 


The form of the transfer function to be tested is 


W K i + K 2 S 

— = ( 8 ) 

V e S + K s 

with K i, K 2 , and K s to be determined for stability and 
ability to correct V e , the error in velocity. The rate input 
to the autopilot is limited in magnitude to 0.02. 

A setup diagram fdr this problem is shown in Fig. 9. 
The elements on the diagram have been numbered arbi- 
trarily in preparation for a description of the diagram. 



Fig. 9— Setup diagram for transfer function problem. 


The description of the diagram is given in Fig. 10. 

A list of cards keypunched from the description is 
shown in Fig. 11, and a list of the Fortran program 
generated by the compiler is given in Fig. 12. This pro- 
gram was processed by Fortran, and the resulting ma- 
chine program was run. The results of a run with 
Ki = 0.003, K 2 = 0.0035, and i£ 3 = 0.5 are shown in Fig. 
13. Finally, a graph of the results as plotted by a card 
plotter is given in Fig. 14. 

Experience with the Compiler 

To date, a “breadboard” version of the compiler has 
been in use. It differs from the final version in that the 
error diagnostic routine is not completed. The detection 
of an error in the diagram description ends the compila- 
tion attempt by giving a memory dump. Consequently, 
the “breadboard” compiler is not as convenient to use, 
inasmuch as several passes on the computer may be 
required if several errors are committed in describing 
the diagram. In spite of this shortcoming, the “bread- 
board” compiler has proven to be quite useful. 

Nine problems have been done with the compiler to 
date; consequently, accurate statistics on its use are not 
available. However, orders of magnitude can be given. 
The analog diagrams for the problems ranged in size 
from approximately 20 elements to over 300 elements. 
The time required to prepare and check the diagram 
descriptions varied from one hour to about six hours 
for the largest problem. The computer time required to 
compile the program (including Fortran compilation) 
was from one-tenth of an hour to approximately one 
hour. The programs for problems which had also been 
run on the analog computer used approximately four 
times the amount of computer time required for an 
equivalent but less accurate analog run. These were 
small problems, however. Larger problems would give 
a much more unfavorable ratio. 

As would be expected, the users of the compiler have 
been far more familiar with analog than digital compu- 
tation. Nevertheless, it has been possible for them to 
originate problems for the digital computer and obtain 
solutions as accurate as those provided by hand-tailored 
programs. In general, this lack of familiarity with digital 
techniques has caused little difficulty since the compiler 
causes the digital computer to have many of the proper- 
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AUTOPILOT TRANSFER FUNCTION 



16IN 

100. 

N 

31 



17SU 


N 

19 

1 


8LM 

+ .02 

-.02 N 

17 



13IN 

0 . 

N 

2 



6IN 

0 . 

N 

13 



23DI 


N 

7 

22 


3MS 


N 

6 

23 


1PT 

.0001 

C 

15 



2PT 

10. 

C 

9 



4SU 


c 

5 

16 


5PT 

.5 

c 

20 



7RF 

3.22E6 

c 




91 N 

0 . 

c 

8 

18 

11 

10IN 

0 . 

c 

14 



11PT 

4. 

c 

9 



12RF 

5.E4 

c 




14RF 

500. 

c 




15IN 

0 . 

c 

4 



18SU 


c 

13 



19PT 

.0020 

c 

21 



20SU 


c 

15 



21SU 


c 

4 



22SU 


c 

12 

10 



Fig. 11 — -List of cards punched from description. (The sequence 
has been altered to order problem output.) 





AUTOPILOT TRANSFER FUNCTION 
DIMENSION A(12), X(017), Y(007), Q(007) 

EQUIVALENCE (X(001), T), (X(002), P016), (Y(002), D016), 

1 (X (003), P013), (Y(003), D013), (X(004), P006), (Y(004), D006), 

2 (X(00S), P009), (Y(OOS), D009), (X(006), P010), (Y(006), D010), 

3 (X(007), P015), (Y (007), D015) 

EQUIVALENCE (X(008), XU008), (X(009), XL008), 

1 (X(010), XK001), (X(011), XK002), (X(012), XK005), (X(013), XR007), 

2 (X(014), XK011), (X(015), XR012), (X(016), XR014), (X(017), XK019) 

20000 A(l) == .5 
A(2)=. 29289322 
A(3) = 1.7071068 
A(4) = . 16666667 
A(5) = 1 . 

A(6) = .29289322 
A (7) = 1.707 1068 
A(8) = . 33333333 
A (9) = .5 

A(10) = .29289322 
A(ll) = 1.7071068 
A(12) = .5 
X(001)=0. 

X(002) = 100. 

X(003) = 0. 

X(004) = 0. 

X(005) = 0. 

X(006) = 0. 

X(007) = 0. 

X(008) = + .02 
X (009) = -.02 
X(010) = .0001 
X(0U) = 10. 

X(012) = .5 
X(013) =3 .22E6 
X(014) =4. 

X(015)=5.E4 
X(016) =500. 

X(017)= .0020 

READ 20001, NTO, MP, DT, CT. NC 

20001 FORMAT (11, 18, F16.8, F15.8, 19) 

PRINT 20002 

20002 FORMAT (49H1 AUTOPILOT TRANSFER FUNCTION 

1 55HO TYPE OUTPUT OUTPUT INTERVAL TIME INTERVAL) 
IF (NTO)20003, 20005, 20003 

20003 PRINT 20004, MP, DT 

20004 FORMAT (14H CHECKOUT, 115, F23. 6) 

GO TO 20007 

20005 PRINT 20006, MP, DT 

20006 FORMAT (13H NORMAL, 116, F23 .6) 

20007 PRINT 20008, CT, NC 

20008 FORMAT (33HO CUTOFF TIME NO. CHANGES/1H, F19.6, 110) 
IF(NC)20015, 20015, 20009 

20009 PRINT 200010 

20010 FORMAT (31HO I X(I)) 

DO 20013 J = 1, NC 

20011 READ 20012, I, (X(I)) 

20012 FORMAT (13, F14.8) 

20013 PRINT 20014, I, (X(I)) 

20014 FORMAT (1H, 111, F23.8) 

20015 ASSIGN 20022 TO JUMP1 
ASSIGN 20026 TO JUMP2 
ASSIGN 20016 TO JUMP3 


GO TO 20017 

ASSIGN 20018 TO JUMP1 
ASSIGN 20023 TO JUMP2 
ASSIGN 20031 TO JUMP3 
N = MP 

DO 20023 J = 1, 4 

GO TO JUMP1, (20022, 20018) 

DO 20021 1 = 1,007 

IF (Y (I)) 20020, 20019, 20020 

R = 0. 

GO TO 20021 
S= — DT * Y(I) 

R = A(J)*S-A(J+4)*Q(I) 

X(I)=X(I)+R 

Q(I)=Q(I)+3. *R— A(J+8) *S 

CONTINUE 

Y(l) = — 1 . 

P001 =XK001 * P015 

P002 =XK002 * P009 

P012 = XR012 

P022 = — P012 — P010 

P007 = XR007 

P023 = - 100 . * P007/P022 

P0031 =P023 * P006/100. 

P020= — P015 

P005 = XK005 * P020 

P004 = — P005 — P0 1 6 

D006 = +P013 

P02l = -P004 

P019 = XK019 * P021 

P017= -P019-P001 

IF(P017 — XU008)02008, 01008, 01008 

P008 = XU008 

GO TO 05008 

IF (P017 — XL008)04008, 04008, 03008 

P008 = P017 

GO TO 05008 

P008 = XL008 

CONTINUE 

P011 =XK011 * P009 

P018= — P013 

D009 = +P008+P018+P011 

P014 = XR014 

D010 = +P014 

D013 = +P002 

D015 = +P004 

D016= +P0031 

GO TO JUMP2, (20026, 20023) 

CONTINUE 
N = N — 1 

IF (N) 20025, 20025, 20031 
N = MP 

IF (NTO) 20027, 20029, 20027 

PRINT 20028, T, P016, P017, P008, P013, P006, P023, 
P0031, P001, P002, P004, P005, P007, P009, P010, P011, 
P012, POM, P015, P018, POM, P020, P021, P022 
FORMAT (1HO, F14.6/(1H, 4F14.6)) 

GO TO 20030 

PRINT 20028, T, P016, POM, P008, POM, P006, 

P023, P0031 

GO TO JUMP3, (20016, 20031) 

IF (X(l)-CT) 20017, 20000, 20000 


Fig. 12 — List of Fortran program generated by compiler. 
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TYPE 

OUTPUT 

NORMAL 

CUTOFF 

TIME 

80.000000 

I 

10 

17 

0.TIME 
100. 000000 Ve 


OUTPUT INTERVAL 
40 

NO. CHANGES 
2 

X(I) 

0.00030000 

0.00350000 

-0.350000w -0.020000W 


TIME 

INTERVAL 

0.025000 


0.6 


o.-e 

6439.999878T 

0. —V 


1 .000000 
99.771433 
0.011025 

-0.235164 

6505.050415 

-0.020000 

0.717208 

-0.020674 

1.999999 

98.355980 

0.032057 

-0.161977 

6571.428467 

-0.020000 

2 . 106574 

-0.020226 

2.999998 

95.577519 

0.052006 

-0.113329 

6639.174988 

-0.020000 

3.452772 

-0.019941 

3.999997 

91.439002 

0.071998 

-0.079243 

6708.333008 

-0.020000 

4.829841 

-0.020008 

4.999996 

85.908216 

0.092000 

-0.053577 

6778.946838 

-0.020000 

6.236650 

-0.019999 

5.999995 

78.955898 

0.112000 

-0.032604 

6851.063110 

-0.020000 

7.673180 

-0.020000 

6.999994 

70.551993 

0.131932 

-0.014055 

6924.730286 

-0.014055 

9.135929 

-0.019236 

7.999993 

60.833487 

0.144597 

0.002923 

6999.998901 

0.002923 

10.121800 

-0.004096 


Fig. 13 — A sample of output from the program. 


ties of the analog computer as far as the user is con- 
cerned. However, the nonlinear elements, the switch 
and the limiter, have caused difficulties for the user in 
some cases. Either of these elements can introduce dis- 
continuities in the functions being integrated. The re- 
sponse of an analog integrator to a discontinuity is 
nearly instantaneous; the response of the digital pro- 
gram is not. The numerical integration tends to smooth 
discontinuities, the amount of smoothing depending 
upon the integration interval. Then, too, in contrast to 
the analog computer, switching or the start of limiting 
can only occur at the beginning of or midway in an 
integration interval. Because of these effects, a certain 
amount of experimenting was sometimes necessary in 
choosing an integration interval. 

When the compiler was undertaken, it was antici- 



T1ME 


Fig. 14 — A plot of the velocity error against time. 

pated that it would have three uses — checking analog 
runs, obtaining greater accuracy for selected analog 
runs, or originating problems with no intention of 
making any actual analog runs. With the first two uses 
in mind, the elements which the compiler recognizes 
were made as similar to actual analog elements as possi- 
ble so that existing analog diagrams could be converted 
with little or no change. In practice, the compiler has 
been used principally as a convenient way of originating 
problems for the digital computer. For these problems, 
the close duplication of analog idiosyncrasies has proven 
to be an inconvenience rather than an aid. Therefore, 
consideration is being given to producing another ver- 
sion of the compiler that recognizes elements more con- 
venient for originating digital computer programs. 

Experience has indicated that many problems are 
solved on an analog computer which cannot be trans- 
formed into programs by the compiler because of im- 
plicit relationships between derivatives or between the 
variables. Some of these problems contain implicit re- 
lationships involving nonlinear functions. However, this 
type of problem does not occur frequently enough to 
justify the effort required to develop a new compiler. 
Most of these problems, however, involve implicit rela- 
tionships which can be eliminated by solving a system 
of simultaneous, linear, algebraic equations. Considera- 
tion is being given to the extension of the compiler to 
include this type of problem. 

In conclusion, the authors feel that the compiler will 
be an extensively used program at Convair- Astronautics 
and will fully justify the modest expenditure of effort 
involved in its development. 
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Automatic Design of Logical Networks* 

T. C. BARTEEf 


T HE frequent use of transcendental functions in 
digital computer programs has presented a basic 
problem in the design of real-time control systems. 
Since there are usually no single instructions to perform 
such functions as sine x, arc sine x, etc., two other tech- 
niques have been used. 

The first technique involves storing a table of values 
for the function in rapid access memory. The advantage 
of this technique lies in the speed obtainable; however, 
in order to provide reasonable accuracy, the table must 
be of considerable length. Computations which involve 
physical measurements such as those made by one of 
the newer radars would require a table which would oc- 
cupy a significant portion of most storage devices. An 
alternate technique involves storing a smaller table, 
and then interpolating between the values stored in this 
table. The principal advantage of table storage is then 
lessened, for the interpolation routine requires time and 
the function can no longer be generated as quickly. 

The second technique involves the use of a pro- 
grammed approximation routine. Most of the routines 
used are based on polynomial approximations which re- 
quire several multiplications, and as a consequence, are 
time-consuming. 

The utilization of digital computers in real-time con- 
trol systems which must process large amounts of data 
at high speeds has made the problem more acute. Most 
programs for such systems require values for sines, 
cosines, etc., quite often, and the necessary computa- 
tions must be made quickly and with considerable ac- 
curacy. The need therefore exists for a very fast and ac- 
curate method of generating trigonometric functions. 

Lincoln Laboratory has recently prepared a set of 
flexible computer programs which automatically per- 
form the design of logical networks which will perform 
such functions in a single step. These networks may 
then be connected into the arithmetic element of an in- 
ternal-binary-operation digital computer, providing the 
machine with instructions which will yield the sine, arc 
sine, etc., of an angle stored in one of the registers of the 
arithmetic element. The design technique yields a logi- 
cal network with a number of input and output lines, 
the input lines representing the independent variable 
and the output lines the dependent variable. The net- 
works are completely digital in operation, utilizing high- 
speed transistor and diode logical circuitry. 

Because of the magnitude of the design problem for 
large logical networks, conventional manual design tech- 

* The work reported here was performed at Lincoln Lab., Mass. 
Inst. Tech., Lexington, Mass., with the joint support of the Army, 
Navy, and Air Force. 

f Lincoln Lab., M.I.T., Lexington, Mass. 


niques could not be used. For instance, the complete set 
of expressions needed to describe a 12-input bit and 14- 
output bit sine network in developed normal form con- 
tains approximately 90,000 symbols. Normal design 
techniques involving manual calculations would pro- 
hibit either the generation or simplification of a set of ex- 
pressions of this complexity. It was necessary, therefore, 
to design and prepare a set of digital computer programs 
which would automatically generate the desired set of 
expressions, simplify them, and then check the simpli- 
fied equations. The series of programs written are gen- 
eral purpose in design and may be used to generate and 
simplify the logical equations describing any function 
with a unique value of the dependent variable for each 
value of the independent variable. 

Only three basic logical relationships or operators are 
used in the equations. These are defined as logical addi- 
tion, logical multiplication, and complementation. Fig. 
1 illustrates a standard diode circuit for the logical addi- 
tion relationship. The inputs to this diode “or” circuit 
consist of dc levels of —5 and +5 volts corresponding to 
the logical symbols 0 and 1, respectively. To the left of 
the figure is the block diagram symbol for the “or” cir- 
cuit. The plus sign in the block is also used to indicate 
logical addition in the written expressions. This is, of 
course, the “inclusive or” function, sometimes referred 
to as “logical disjunction.” 

Fig. 2 illustrates a diode “and” circuit which performs 
the logical multiplication function, sometimes referred 
to as “logical conjunction.” The block diagram symbol 
for the “and” operation is to the left of this figure. 

Fig. 3 illustrates a two-level diode “and-to-or” circuit. 
There are six inputs to this circuit, each consisting of dc 
levels. The expression describing this circuit in Boolean 
algebra notation is written at the far right of the figure. 
This expression may be read a and b and c, or d and e 
and / or the product of a , b and c, plus the product of 
d, e and /. According to the notation used, the output 
will be equal to 1 only when a and b and c or d and e and 
/ are equal to 1 . 

The third logical operation used is the complement 
function, sometimes referred to as inversion or negation. 
A transistor inverter designed for the Lincoln TX-2 
computer is illustrated in Fig. 4. A prime sign (') is used 
to indicate the “ complement” or “not” function in the 
expressions. When flip-flops are used to provide the in- 
puts to the networks, complemented as well as the un- 
complemented values of the input variables are avail- 
able. 

At a given time each input line to a logical network 
will contain a signal representative of either a binary 0 
or 1. The fundamental characteristic of the logical net- 



104 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


-30 VOLTS 





INPUT 

OUTPUT 

x y 

z 

0 0 

0 

0 \ 

i 

\ 0 

< 

\ \ 



INPUT 
x y 

OUTPUT 

z 

-5V-5V 

-5V+5V 

+5V-5V 

+5V+5V 

- 5V 

+ 5V 
+ 5V 
+ 5V 


Fig. 1 — The “or” circuit. 
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Fig. 2 — Diode “and” circuit. 
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Fig. 4 — Transistor inverter. 


work is that for each combination of input values there 
will be a unique output value, represented by signals on 
the output lines. Since the output from a logical network 
of this sort is determined by the inputs to the logical 
network, it is possible to construct a logical network that 
will perform any function having the characteristic that 
for each state which the inputs assume, there will be a 
unique output. It may be seen that this does not limit 
the function performed to transcendental or straight- 
forward mathematical relationships, but allows any in- 
put-to-output relationship. 

When a logical network contains several input and 
output lines, it is possible to simplify the design problem 
by considering only one output line at a time. The sig- 
nal on this particular line is determined by the values 
of the inputs to the logical network at that time, and 
may be specified by means of a Boolean function of the 
input variables. This expression will be equal to 1 when 
the output from this particular line of the network is 1 
and to 0 when the output is 0. An expression that repre- 
sents the operation of a single line of a logical network is 
called the transmission function for that line of the net- 
work. A fully developed transmission function, when 
written in the sum-of-products form, consists of a num- 


ber of terms, each of which is a product of all of the input 
variables, certain of which may be complemented. This 
type of expression is referred to as a canonical expansion 
for the circuit transmission. 

The first step in the design procedure for a network 
with several input and output lines consists of deriving 
the canonical expansion for each output line of the net- 
work. Table I shows the derivation of the canonical ex- 
pansion for the transmission function for one output 
line bit of a small network which will yield the sine of x 
within the limits 0° to 90°. The leftmost column of the 
table lists the angle x in degrees, starting with 0° and 
increasing by increments of 6° to 90°. The next column 
contains the same set of angles coded as binary num- 
bers, starting with 0000 and increasing to 1111 by steps 
of 0001, which increases the angle by 6° in each case. 
The values of sine x, expressed in binary form, are listed 
in the next column of the table. To the right of the col- 
umn representing the sine values is a column listing the 
Boolean symbols for the input variables in product- 
term form. The variables are primed or unprimed de- 
pending on whether or not the respective input value is 
0 or 1. The computer program generates the sine value 
for each input angle and then, examining one bit of the 
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TABLE I 

Canonical Expansion for Sin * from 0° to 90° 


X 

(degrees) 

(binary form) 
abed 

Sin x 

(binary form) 

(1) (2) (3) (4) 

Product 

Terms 

0 

0 

0 

0 

0 

0 

0 

0 

0 

a‘ 

' b ' 

c' 

d' 

12 

0 

0 

1 

0 

0 

0 

l 

1 

a 1 2 

’ b' 

c 

d' 

18 
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0 

1 

1 

0 

1 

0 

0 

a ‘ 

' b' 

c 

d 

24 

0 

1 

0 

0 

0 

1 

1 

0 

a‘ 

' b 

c' 

d' 

30 

0 

1 

0 

1 

1 

6 

0 

0 

a 

' b 

c' 

d 

90 

1 

1 

1 

1 

1 

i 

1 

1 

a 

b 

c 

d 










l 


T(2) =a'b'cd-\-a , bc'd' +ab'c 'd -\-ab 'cd ' +ab 'cd -\-abc 'd ' -\-abc 'd 
'-\-abcd' -{-abed 


The first step of the minimization procedure consists 
of matching each term of the canonical expansion with 
each of the other terms, and if the terms differ in only- 
one variable, eliminating that variable. This matching 
procedure actually consists of the repeated, application 
of the theorem (<f>a +0a' = 4>). Table II illustrates the 
first step of this matching procedure, using the product 
terms which comprise the expression developed in Table 
I. It is important that a given term be matched with all 
of the other terms and not dropped after the first 
match. For instance, abe'd' in Table II must be matched 
with both abe'd and abed'. After the first set of matches, 
the resulting terms will each have one variable less than 
the original set of terms. Since, in Table II, the first set 
of terms consists of four variables each, the next col- 


TABLE II 


Derivation of Prime Implicants 
Theorem (<j>a +<£«' =<t>) 


First Cycle 

Second Cycle 

Third Cycle 

First Cycle 

Second Cycle 

Third Cycle 

a' b' 

c 

d 

— 

b' 

c 

d 
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0 

1 

1 

— 

0 

1 

1 


a' b 

c' 

d' 

- 

b 

c' 

d' 


0 

1 

0 

0 

- 

1 

0 

0 


a b' 

c' 

d 

a 

b' 

- 

d 

a — — d 

1 

0 

0 

1 

1 

0 

- 

1 

1 - - 1 

a b' 

c 

d' 

a 

- 

c' 

d 

a - c - 

1 

0 

1 

0 

1 

- 

0 

1 

1 - 1 - 

a b' 

c 

d 

a 

b' 

c 

- 

a b - - 

1 

0 

1 

1 

1 

0 

1 

- 

1 1 - - 

a b 

c’ 

d' 

a 

- 

c 

d' 


1 

1 

0 

0 
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c' 

d 

a 

- 

c 
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1 

1 

0 

1 

1 

- 

1 
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c 

d' 
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b 

c' 
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1 

1 

1 

0 

1 

1 

0 
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a 

b 
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1 

1 

1 
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1 

- 
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1 

1 

- 
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1 

1 

1 
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sine values at a time, develops and stores the canonical 
expansion for each output line of the logical network. 
In effect, the computer proceeds down a column of out- 
put values until it finds a 1. It then stores the input 
values which generated the 1 output in the output table. 
Table I illustrates the derivation of the product terms 
for the second bit of the four output bits. Each 1 in the 
second-output-bit column is underlined, as is the prod- 
uct term for this particular output. The complete canon- 
ical expansion for the output line representing the sec- 
ond least significant output bit of this particular table is 
illustrated at the bottom of the figure. An expression is 
therefore generated for each output line of the logical 
network. 

The minimization technique which has been pro- 
grammed is based on work done by Quine 12 of Harvard, 
and by McClusky 3 during the preparation of his doc- 
toral dissertation at M.I.T. The input to this particular 
program is the canonical expansion developed by the 
preceding program, and the output is a minimized 
equivalent sum-of-products expression. 

1 W. V. Quine, “The problem of simplifying truth functions,” 
Amer. Math. Monthly, vol. 59, pp. 521-531; 1952. 

2 W. V. Quine, “A way to simplify truth functions,” Amer. Math. 
Monthly, vol. 62, pp. 627-631; 1955. 

3 E. J. McCluskey, Jr., “Minimization of Boolean functions,” 
Bell Sys. Tech. J., vol. 35, pp. 1417-1444; 1956. 


umn, containing the shortened terms, will consist of 
terms of three variables. In Table II the missing vari- 
ables are indicated by dashes. If any term of the expres- 
sion does not match with any other term, it is then a 
“prime implicant” term and will be a term of the final 
prime implicant expression. 

After the original terms have all been compared, a 
second set of terms, each one variable shorter than the 
original terms, will have been derived. These terms are 
then matched with each other, using the same theorem 
(kaf-^a' = (j). If the remaining variables are maintained 
in their original positions in the terms, and the elim- 
inated variables indicated by means of a dash, as in 
Table II, the matching process may be made somewhat 
simpler to perform. In this case the terms are matched 
on the following basis: first, the dashes indicating the 
missing variable or variables must be in the same posi- 
tion (that is, ab-d' and ab-d may be matched but there 
is no possibility of matching a-cd with ab-d) and second, 
the remaining variables must all be identical save one 
(that is, ab-d' can be matched with ab-d, yielding ab — ). 
This process is continued until no further matches can 
be made. 

Third and further cycles of this process are continued, 
using the same rules, until a single pass through a cycle 
yields no matches. The remaining terms plus all the 
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terms that did not match during the process comprise 
the prime implicants. 

The right half of Table II illustrates how the above 
matching process is performed in the computer. The 
terms of the original expression are stored in binary 
form, using positional notation to maintain the identity 
of the variables. A 1 is used to indicate an unprimed 
variable and a 0 to indicate a primed variable, so that 
db'cd is expressed as 1011. Since variables will be elim- 
inated in this process, it is convenient to utilize another 
register of computer storage as a mask, and to alter the 
mask as variables are eliminated, while maintaining the 
variables in order. Each term is therefore represented by 
two registers, one containing the “values” of the vari- 
ables and another the mask for the term. The mask is 
altered to indicate the eliminated variables. For in- 
stance, if two terms 1011 and 1010 and their masks are 
matched, the resulting term is 101- or ah' c. The first 
step in the matching process in the computer, therefore, 
is to compare the masks to see if they are identical. If 
the masks are identical, the terms are then matched 
and if they differ in only one variable, a new term is 
formed along with a new mask which indicates the miss- 
ing variable or variables. 

In order to shorten the matching problem, McCluskey 
has shown that the terms may first be sorted according 
to the number of l’s in each term. Table III illustrates 
the same set of terms after they have been sorted and 
arranged in tabular form. Each section of a column of 
the table contains terms with one more 1 than the terms 
in the preceding section of the table. It is necessary to 
match only the terms from one section of the table with 
the terms in the preceding and following sections of the 
table, for two terms which differ by more than one 1 
cannot match: 1011 cannot match with any term con- 
taining only one 1, for instance 1000, for the two terms 
must, of necessity, vary in more than one variable. 
Further, if the terms from one section of the table are 
matched with those of the next section, the resulting 
shortened terms can be matched only with shortened 
terms formed by matches in the preceding and following 
sections of the table. This technique significantly re- 
duces the number of matches which must be made and 
also materially lessens the amount of fast-access mem- 
ory that is required at a given time. The number of 
terms formed by the matching process tends to increase 
considerably for large problems before finally decreas- 
ing. By storing only the sections of the table which are 
being matched in high-speed memory, and storing the 
rest of the terms on tape or drums, large problems may 
be handled more easily. 

The set of terms derived in this manner, if collected 
in sum-of-products form, will form an expression 
equivalent to the original expression. An important 
characteristic of these terms is that none of them can be 
shortened by omitting a variable. Quine has shown that 
the shortest sum-of-products expression must consist of 
a subset of these terms. It may be shown that certain 


TABLE III 

Ordered Derivation of Prime Implicants 


First Cycle 

Second Cycle 

Third Cycle 

0 1 

0 

0 
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0 

0 

1 - - 1 


0 0 
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1 

1 

1 - 1 - 


1 0 

0 

1 

1 
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1 1 - - 


1 0 

1 

0 

1 

- 

0 

1 



1 1 

0 

0 

1 

0 

1 
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1 0 

1 

1 

1 

— 
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0 



1 1 

0 

1 

1 

1 

0 

- 



1 1 

1 

0 
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- 

0 



1 1 
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1 
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1 
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1 
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1 

1 

1 

- 




Prime Implicants = (- 1 0 0 + - 0 1 1 + 1 1 + 1 - 1 - + 1 1 ) 

or (bc'd 1 -\-b'cd+ad-\-ac-\-ab) 


+30 -30 



of these terms are common to every possible minimal 
subset, while the remaining terms of each subset may 
be chosen, subject to a set of constraints. The computer 
program has been designed to print automatically those 
prime implicant terms common to every minimum ex- 
pression. The computer then prints a table of the re- 
maining prime implicant terms along with the necessary 
constraint information. Choice of the remaining terms 
in the final expression is made from the table. This final 
choice has been left separate from the computer pro- 
grams to permit flexibility in the design of the circuitry. 

Fig. 5 illustrates a two-level diode circuit which per- 
forms the function developed in Table I. Fourteen 
diodes are required to construct this particular network. 
Since the original canonical expansion would have re- 
quired 45 diodes, 31 diodes were saved by the minimiza- 
tion process. 

The entire process was programmed for the Whirlwind 
computer located at the Massachusetts Institute of 
Technology. This is a general-purpose, stored-program 
computer with a word length of 16 bits. The entire set 
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of programs are slightly over 3000 orders in length. The 
program requires about 40 seconds to generate the 
canonical expansion for each output line of a 12 input- 
line by 14 output-line problem and from eight to ten 
minutes to minimize and print the final expression. 
About 25,000 registers are required to store partial re- 
sults during the processing. As a result, drum storage is 
used during the minimization procedure. The procedure 
is now being programmed for an IBM 709 located at 
the laboratory, making possible the solution of larger 
problems and somewhat shortening the programs’ run- 
ning time due to the large core storage of the 709. 

The design procedure described here appears very 
flexible. It can be used to perform automatically the 
logical design of circuitry which will perform any func- 
tion which has a unique value of the dependent variable 
for each value of the independent variable. 


To date, networks which yield sine, arc sine, and 
the square root of the input value have been con- 
structed. The concept of programmed logic as an aid 
to computer design appears quite attractive for the 
design of future machines. 
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The Role of Digital Computers in the Dynamic 
Optimization of Chemical Reactions 

R. E. KALMANf and R. W. KOEPCKE £ 


I. Introduction 

ALONG with the increasing availability of high- 
f—\ speed, large-storage digital computers, there has 
been growing interest in their utilization for real- 
time control purposes. A typical problem in this connec- 
tion and one of long-standing interest is the optimal 
static and dynamic operation of chemical reactors. 1 - 2 To 
our knowledge, no digital computer is being used for 
this purpose, chiefly because of the many difficulties en- 
countered in utilizing real-time machine computation in 
reactor control. These difficulties range from the un- 
availability or inadequacy of hardware (i.e., transducers, 
measuring instruments, low-level analog-to-digital con- 
verters, etc.) to the lack of a well-established body of 
fundamental theoretical principles. Although a great 
deal is known about the basic concepts governing con- 
trol systems, 3 - 4 present methods cannot be readily ap- 
plied to designing a program for a real-time digital con- 


f Res. Inst, for Advanced Study, Baltimore 12, Md. 

j IBM Res. Center, Yorktown Heights, N. Y. 

1 T. J. Williams, “Chemical kinetics and the dynamics of chemical 
reactors,” Control Engrg., pp. 100-108; July, 1958. 

2 R. Aris and N. R. Amundson, “An analysis of chemical reactor 
stability and control,” Chem. Engrg. Sci., vol. 7, pp. 121-155; 1958. 

3 J. G. Truxal, “Automatic Feedback Control System Synthesis,” 
McGraw-Hill Book Co., Inc., New York, N. Y. ; 1955. 

4 J. R. Ragazzini and G. Franklin, “Sampled-Data Systems,” 
McGraw-Hill Book Co., Inc., New York, N. Y. ; 1958. 


trol computer. This is because the existing design meth- 
ods are applicable primarily to fairly small-scale sys- 
tems, whereas the use of a digital computer (in fact the 
very attractiveness of computer control) arises pri- 
marily in connection with large-scale problems. 

The role of the digital computer in real-time control 
consists essentially of “digesting” large amounts of in- 
formation obtained from the primary measuring instru- 
ments and then calculating, as rapidly as possible, the 
control action to be taken on the basis of these measure- 
ments. 

One purpose of this report is to provide a broad out- 
line of a new approach to designing control systems for 
chemical processes which are to be built around a fast, 
general-purpose digital computer operating in real time. 
The specific engineering details of the computer will not 
be of any interest here; rather, we have concentrated on 
studying the types of computations the computer is to 
perform. To lend concreteness to the discussion, the 
chemical process under consideration will be a con- 
tinuous-flow, stirred reactor. After the fundamental con- 
cepts have been established, the detailed analytic equa- 
tions (in the linear case) leading to the dynamically 
optimal (and thus also statically optimal) design of the 
reaction control system are given in Section III. The 
equations of Section III represent a special case of the 
new design theory of linear control systems formulated 
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by the authors. 6 6 The performance of the dynamically 
optimized control system is illustrated with the aid of 
a numerical example. 

In Section IV the limitations of the linearity assump- 
tion or, rather, the additional steps necessary to attack 
realistic practical problems, are briefly discussed. It is 
impossible to give more than a rough sketch of these 
new methods in a short report; however, specific details, 
mathematical proofs, and discussion of engineering 
problems may be found in the literature. 5-12 

One of the mathematical tools used in the new ap- 
proach has been called dynamic programming by its de- 
veloper, Bellman. 13 This is a new method for the solu- 
tion of problems in the calculus of variations where 
dynamic constraints play the central role. It turns out 
that our new approach to the description of control- 
system dynamics leads to concepts which are also the 
“natural setting” for solving the optimization problem 
by dynamic programming. A second purpose of this re- 
port is to provide a better appreciation of the advan- 
tages as well as the limitations of dynamic programming, 
thereby promoting its use in the solution of engineering 
problems. 

Perhaps the most outstanding advantage of the use of 
dynamic programming in our problem is that it reveals 
the intimate connection between the static and the dy- 
namic optimization of the process. In other words, the 
problem of selecting the operating conditions of the 
process to obtain optimum yield or optimum product 
quality cannot be realistically divorced from the prob- 
lem of providing effective regulation to maintain the 
process at these conditions. Although these matters are 
well known to workers skilled in the control art, they 
are often not clearly understood by others. 

In addition to providing some practical means for the 
solution of reactor control problems, it is hoped that this 
report will help clarify a number of basic questions. 


B R. E. Kalman and R. W. Koepcke, “Optimal synthesis of linear 
sampling control systems using generalized performance indexes,” 
Trans, ASME, vol. 80, pp. 1820-1826; 1958. 

6 R. E. Kalman and R. W. Koepcke, “Dynamic optimization of 
linear control systems. I. Theory. II. Practical aspects and exam- 
ples.” (Scheduled for publication in the IBM J. Res. Dev., vol. 4; 
1960.) 

7 R. E. Kalman and J. E. Bertram, “General synthesis procedure 
for computer control of single and multiloop linear systems,” Trans. 
AIEE, vol. 77, pt. 2, pp. 602-609; 1958. 

8 R. E. Kalman, “Optimal nonlinear compensation of saturating 
systems by intermittent action,” 1957 IRE WESCON Convention 
Record, pt. 4, pp. 130-135* 

9 R. E. Kalman and J. E. Bertram, “A unified approach to the 
theory of sampling systems,” J. Franklin Inst., vol. 267, pp. 405- 
436; 1959. 

10 R. E. Kalman, L. Lapidus, and E. Shapiro, “On the optimal 
control of dynamic chemical and petroleum processes.” (Scheduled 
for publication in Chem. Engrg. Progress .) 

11 P. E. Sarachik, “Cross-coupled multi-dimensional feedback 
control systems,” Ph.D. dissertation, Dept, of Elect. Engrg.-, Co- 
lumbia University, New York, N. Y. ; 1958.- 

12 R. E. Kalman, “On the general theory of control systems,” Proc. 
Internatl. Congr. on Automatic Control, Moscow, U.S.S.R., Academic 
Press, New York, N. Y.; 1960. 

13 R. E. Bellman, “Dynamic Programming,” Princeton University, 
Press, Princeton, N. J.; 1957. 


II. Fundamental Concepts 
A. Description of Chemical Reactor 

The continuous-flow, stirred-tank type of chemical re- 
actor with which we shall be concerned here is shown in 
Fig. 1 . The principal inputs to the reactor consist of 
liquid streams carrying the various raw materials. The 
volume-flow rates of the input streams in Fig. 1 are de- 
noted by Mi, M2, Mi. Each stream carries one or more 
compounds, whose concentrations (measured in terms of 
moles/unit volume in Fig. 1 ) are denoted by f/ 1( • • • , 
U5. Other inputs to the reactor may include a catalyst 
stream (with flow rate Mi in Fig. 1 ) and provisions for 
cooling or heating (with heat- flow rate in Fig. 1 ). 
The numbers X\, • • • , X n denote the concentrations of 
the various compounds inside the reactor (some of which 
come from the input streams and some of which are 
formed chemically inside the reactor) ; one of the Xi will 
denote the temperature of the material inside the re- 
actor. Due to agitation, the concentrations of the vari- 
ous compounds as well as the temperature are assumed 
to be approximately the same at every point inside the 
reactor and in the output stream. In most cases, it is 
desirable to keep the amount of material in the reactor 
constant. This is achieved by means of a level controller 
which keeps the output stream ( Fo in Fig. 1 ) at all times 
approximately equal to Mif- • ■ ■ +M4. 

Agitation 



The object of the reactor is to produce a certain con- 
centration of chemicals in the output streams. To ac- 
complish this with the given types and concentrations 
of raw materials in the input streams, one can vary the 
flow rates Mi, • • • , M$. Since reactions take place more 
rapidly as the temperature increases, control can be 
exerted by changing the temperature in the reactor 
which, in turn, is achieved (subject to the dynamic lags 
of heat transfer to the reactor) by changing the heat- 
input flow-rate M5. The amount of catalyst present in 
the reactor also affects the reactions; the amount is con- 
controlled by changing Mi (subject to a time constant = 
reactor volume / Fo if the amount of catalyst is not af- 
fected by the reaction). Similarly, some measure of con- 
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trol can be exerted by changing the flow rates Mi, M 2 , 
Mz', the effect of these changes is complicated and 
depends on the reaction dynamics. 

B. Statement of the Control Problem 

The principal objectives in designing a reactor control 
system may be stated as follows: 

Problem: Given the desired values Xf, • • • , X n d of the 
concentrations in the output stream at time to, manipulate 
the control variables in such a manner as to bring rapidly 
the actual concentrations existing in the reactor at time to 
as close as possible to the desired concentrations and then 
keep the actual concentrations constant at all times despite 
changes in the concentrations of the input streams, ambient 
temperature, etc. If, at time t\>to, the desired values of the 
concentrations are changed, the above process is repeated. 

We now examine this problem in more detail. In doing 
so, we shall specify precisely what is to be meant by “as 
close as possible” and “rapidly.” 

C. Reaction Dynamics 

Let us assume that p molecules of compound A and q 
molecules of compound B combine chemically to form a 
new compound C. If the concentrations X A , X B , X c , of 
the various compounds are small, the rate of increase of 
the concentration of compound C is given by the well- 
known Arrhenius equation. 1 ’ 2 

dXc/dt = k AB {T)X A *X B «. (1) 

In (1), the reaction rate coefficient is given by 

k AB (T) = a AB exp ( — E ab /RT), (2) 

where <xab is a constant, E ab the activation energy of the 
reaction, T the absolute temperature, and R the gas 
constant. Moreover, the rate of decrease of the concen- 
tration of compounds A and B resulting from the reac- 
tion is equal to p resp. q times the right-hand side of (1). 

In qualitative physical terms, the Arrhenius equation 
has the following interpretation. Consider a small vol- 
ume with diameter equal to the effective range of inter- 
molecular forces. If p molecules of A and q molecules of 
B have entered this small volume, a reaction takes place, 
but not otherwise. In a dilute solution, the probability 
of a molecule of some compound entering the small 
volume as a result of thermal agitation is proportional 
to the thermodynamic factor exp {—Eab/RT) and the 
concentration of the compound, but independent of the 
concentration of the other compounds. The probabilities 
of independent events multiply, hence (1). 

In general, the assumptions which lead to the par- 
ticular form of (1) are not true, but the reaction rate is 
still a function of the temperature and concentrations. 
Thus, in general, one would replace (1) by 

t dXc/dt = k AB (X A , X B , Xc, T ), (3) 

where k AB is some scalar function of the four variables 
indicated. 


It follows that the reaction shown in Fig. 1 can be 
described by the set of differential equations 

dXi/dt = fi(X h • • • , I n ; M !,•••, Mi) U h • • • , U k ) (4) 
(i = 1 , • • • , n; k, l, n = integers). 

This is a good place, conceptually and in order to 
simplify the symbolism, to introduce vector-matrix 
notation. Thus, let X be a vector (»X 1 matrix) with 
components X\, • • • , X n . Similarly, M and U are de- 
fined as a (/XI) and (&X1) matrix, respectively; / is a 
vector function of k-pl-fn arguments with components 
fi, • * ' ,/*. 

In terms of the new notation, (4) becomes 

dX/dt = f(X, M, U). (5) 

The vector X is called the state of the reactor and the 
components of X are known as the state variables. The 
reason for this terminology is that if the reactor inputs 
M{t) and U(t) are specified for all time t>to, then the 
knowledge of X{tf) supplies the initial conditions from 
which the solutions of the differential equation (5) can 
be uniquely determined (subject to some mild mathe- 
matical restrictions) for all future values of time. 
Thus, the state is a fundamental mathematical concept 
for describing the reactor dynamics ; it is also a physical 
concept. The temperature and various concentrations 
can be physically measured (at least in principle) ; thus 
the state at time to may be regarded as the information 
necessary to determine the properties of the material 
inside the reactor at time to. 

The behavior of the reactor through time may be 
visualized as a succession of changes in state. This gives 
rise to the concept of the state-transition function. In 
fact, the function / in the differential equation (5) may 
be regarded as specifying the incremental state transi- 
tions taking place during the interval {t, t-fdt). For 
present purposes, it is more convenient to deal with 
finite-interval state transitions which are obtained by 
solving the differential equations. Anticipating the later 
discussion, let us note that for control purposes it is 
sufficient to sample the state of the process; i.e., observe 
the state only at discrete instants in time, called sam- 
pling instants. Usually, the sampling instants are sepa- 
rated by equal intervals t of time (r is called the sam- 
pling period), i.e., the sampling instants occur at times 

to, to + T, to + 2r, • • • . 

Now suppose that r is chosen to be so small that in the 
interval {to, /o+t) the functions M(t), U{t ) in (5) may 
be adequately approximated by the constants M{to), 
U{to ). Then (5) can be readily integrated (if necessary, 
by numerical methods) and we get 

X{to + r) = </>(r; X(t 0 ), M(t 0 ), U(t 0 )), (6) 

where 0 is a vector function with n components and 
k-pl-fn arguments. 
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D. Static Optimization 


Precisely what is meant by . the phrase, “as close as 
possible to the desired concentrations” in the statement 
of the basic problem in Section II-B? 

The states of the reactor may be represented as 
points in w-dimensional Euclidean space (the state 
variables being coordinates of the point) called the state 
space. Suppose we specify r components of the state vec- 
tor as desired values, with the remaining n — r com- 
ponents being arbitrary. This step may be regarded as 
essentially a management decision, relating to the ques- 
tion of how one should try to operate the reaction proc- 
ess. The set of states for which the operation of the reac- 
tor meets the management requirements is clearly an 
{n — r ) -dimensional hyperplane. If the state of the re- 
actor at any instant of time does not lie in the hyper- 
plane, we can measure the “badness” of that state by 
the distance of the state from the hyperplane of desired 
states (see Fig. 2). The definition of the distance func- 
tion (technically, a pseudo-metric) is arbitrary and de- 
pends on a management estimate as to what types of 
deviations from the desired values are more harmful 
than others. One possible definition of the distance func- 
tion is 


P (X“ - X) 


- r —I 1/2 

E W - x,y 


(r < n). (7) 


More generally, if Q is any positive semidefinite matrix, 
we can define p by the quadratic form 

p{X d - X) = (X d - X) 'Q{X d - X), (8) 

where the prime denotes the transpose of the matrix. 

By static optimization of the process we mean select- 
ing a set of constant values M° of the control variables 
(subject to some magnitude constraints), so that at 
equilibrium the actual state lies as close as possible to 
the hyperplane of desired states. By definition, the 
equilibrium states X* of the reactor are given by: 

dX/dt = /(X*, M, U ) = 0, (9) 

M, U being constant vectors. Thus the statically op- 
timal control vector M° and equilibrium state X*° are 
determined by solving the minimization problem 

Min p(X d - X*), 0 < Mi < (10) 

M 


To find the optimal control vector M° from (10), X* has 
to be expressed as an explicit function of M from (9). 
This and the amplitude constraints on the control vari- 
ables lead to great analytic difficulties when / is a non- 
linear function. But even in cases where the static op- 
timization problem can be solved, it does not provide a 
complete answer to the basic problem. This is because: 

1) Static optimization does not provide a guide as to 
how the control variables should be manipulated to 
bring an arbitrary state as close as possible (in terms of 
the arbitrarily adopted distance function) to the desired 
state (dynamic optimization) . 


E r 


x d 


•i 

l /.(x-x") 

j Hyperp lane of 

Desired States 


0 


E 


n-r 


Fig. 2. 


2) The equilibrium state closest to the hyperplane of 
desired states may not be stable. 

3) The values of the control variables computed by 
static optimization will not remain optimal when some 
of the process parameters (concentrations in the input 
flows, ambient temperature, etc.) change. In other 
words, static optimization does not incorporate the im- 
portant principle of feedback. 

In the following it is shown that it is possible to com- 
bine both dynamic and static optimization in such a way 
that the principle of feedback is retained. 

E. Dynamic Optimization 

In our basic problem statement in Section II-B, the 
last remaining word to be defined precisely is “rapidly.” 

A performance index for the reaction under dynamic 
conditions may be defined as 

/ > 00 

p [X d - X(0 ] exp [ a (t - to) ]dt, (11) 

to 

where a: is a real constant. We now agree that the phrase, 
“. . . to bring rapidly the actual concentration as close 
as possible to the desired concentrations ...” in the 
problem statement means that the control variables 
M(t) are to be chosen, as functions of time, in such a 
way as to minimize the performance index (11) for any 
initial state X(to). This is called dynamic optimization. 
Of course, the definition of (P, in particular the value of a 
in (11), is arbitrary and depends on management esti- 
mates just as the definitions of X d and p. 

Static optimization is evidently a special case of 
dynamic optimization, as may be seen by setting 
X(to) =X*° in (11). In fact, it may happen that the dy- 
namic optimization leads to the result that, instead of 
trying to maintain the control variables at constant, 
(equilibrium) values, it is better to vary the control 
variables continuously, say, in a periodic fashion. In 
such a case, dynamic optimization will lead to a smaller 
value of (P(X*°) than static optimization. 

In order to perform dynamic optimization, we must 
find a particular vector function M°(t), defined for all 
t>to, among the set of all such functions (subject to 
amplitude constraints) for which the integral (11) 
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assumes its minimum or least upper bound. This is 
generally a very difficult problem in the calculus of vari- 
ations and, for all practical purposes, cannot be solved 
by conventional analytic methods when the number of 
state variables is large. 

So as not to be bothered by certain mathematical 
niceties, we shall assume from here on that the control 
variables have constant values over the sampling inter- 
vals r (cf. Section II-C). 

Thus instead of minimizing (P with respect to all pos- 
sible functions M(t), the minimization is to be performed 
with respect to all possible sequences of constant vectors 

M(t 0 ), M(t 0 + r), M(to +2 r), • • • (0 < Mft) < Mf ). (12) 

Moreover, again for simplicity, the integral in (11) may 
be replaced by a sum : 

co 

(P[X(t 0 )] = T,p[X d ~ X(t 0 + *r)]X*, (13) 

k~l 

where X = exp ar. 

From (6) we see that there is a large number of pos- 
sible state transformations X(t 0 ) — >X(/ 0 +r), depending 
on the choice of M(to) (assuming that U = const). 
Similarly, the state transformation X (to~\-r) —>X (to~\-2r) 
depends on the choice of M(to~\-r) (see Fig. 3). Thus the 
minimization of (P may be regarded as an infinite-step 
decision procedure. The optimal choice of M(to-\-kr) at 
the kth step in general depends both on the preceding 
and succeeding steps. Therefore, at first sight, it would 
appear that to obtain the optimal sequence of control 
vectors, 

M°(to), M°(t 0 + r), M°(t o + 2t), • • . (14) 

we must simultaneously minimize (13) with respect to 
all the terms of the sequence (12), which is an impossible 
job. 

Fortunately, at this point we can achieve a decisive 
simplification by making use of the following intuitively 
obvious, but powerful, observation due to Bellman. 13 

Principle of Optimality : An optimal sequence of control 
variables (14) has the property that , whatever the initial 
state X (t 0 ) and the initial choice M°(t 0 ) of control vector are , 
the remaining terms M 0 (toA~r ) , M (i (to J c2r) , • • • of (14) 
must constitute an optimal sequence with regard to the state 
X(to-\-r) resulting from the choice of M°(t 0 ). 

Using the principle of optimality, we can obtain vari- 
ous expressions for the theoretical study and practical 
determination of the optimal control sequence (14). 
(Methods derived from the principle of optimality are 
known by the generic name of dynamic programming.) 

We first observe that (13) can be written in the forme 

<P[X(to)] = Xp[X<* - X(t 0 + r)] 

00 

+ T, p[X d - X(fo+ k T )]\ k 

&=2 

= xP[X«-X(/ 0 +t)] + <p[X(/ 0 +t)]|. (15) 



I 

l 


Fig. 3. 

Now let (?°(X(t 0 )) be the value of the performance in- 
dex when the optimal sequence (14) is used. Substitut- 
ing (15) and invoking the principle of optimality, we ob- 
tain a functional equation (13) for <P° ; i.e., 

(P°[X(/ 0 )] = Min (P[X (/<>)] 

M(to) t M(to+r) ,* • • 

= Min X{p[X d - X(t 0 + r)] + (?°[X(to + r)]}. (16) 

M(t 0 ) 

Note carefully that the right-hand side of (16) is a func- 
tion of M(to) and X(to) through (6). The solution of the 
functional equation (16) determines M°(to) as some 
function of X(to) which may be denoted by 

M°(t 0 ) = h(X(k)). (17) 

Now at time tof-r we are confronted by the same deci- 
sion problem as at time to. This shows that (P° [X(£o+r) ] 
also satisfies the functional equation (16), and therefore 
M°(toA-r) is the same function of the state at time tof-r 
as M°(to) was at time t 0 . Thus we have arrived at the 
following result. 

If the performance of a dynamic system governed by (6) 
is optimal in the sense that the performance index (13) is 
a minimum , then the sequence of optimal control variables 
is obtained by observing the state of the system at times 

U . ), to + r, to + 2 r, • • • 

and computing the optimal control variables at each sam- 
pling instant by means of the formula 

M°(to + hr) = h(X(io + hr)), (18) 

h being determined by solving (16). 

Eq. (18) shows that the feedback principle can be in- 
cluded in the framework of dynamic optimization. This 
means that the entire future evolution of the dynamic 
system, including the values of the optimal control 
variables at each sampling instant, could be predicted in 
principle by means of (6) and (18). However, because of 
the inaccurate knowledge of the state-transition func- 
tion, unknown disturbances acting on the process, and 
random effects such as turbulence, etc., the prediction 
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based on (6) will be less and less correct as the predic- 
tion interval increases. By re-measuring the state of the 
system at the sampling instants [assuming that r has 
been chosen small enough so that the one-step predic- 
tion based on (6) is sufficiently accurate], the prediction 
errors are corrected so that the control variables assume 
very nearly their optimal values at all times. This is the 
conventional use of feedback. As is well known, feed- 
back will also tend to minimize the sensitivity of h to 
variations in <£. 

To solve the functional equation (16), it is often con- 
venient to use an iterative procedure. To derive the 
iteration scheme, we replace the performance index (13) 
by 

N 

«v[x(f„)] = E - x(i a + kr) ]x*. (i9) 

l 

In other words, the original infinite-step decision process 
is converted into a finite-step decision process. Proceed- 
ing exactly as in the derivation of (16), we find that the 
successive optimal performance indexes 6>n° are con- 
nected by the recurrence relations: 


termined from (6) by means of the formulas (assuming 
U = const) 

F a — dfi/dXj \x=x*, m=m* (i, j = 1 , • • • , n) 

Da = dfi/dMj |x=x*. m=*m* 

(*= 1» * V •,»;/ = li , 0- (23) 

As is well known, 14 the solution of the differential equa- 
tion (22) has the form: 

x(t ) = $(/ — to) x (to) + f <£(/ — r) Dm(r)dr (24) 

J to 

for any t, to. The matrix $(r) is called the transition 
matrix of the system (22) and is given by 

00 

4>(r) = exp Ft = £ F k i*/k\ (25) 

k= o 

The Taylor series is a convenient way of calculating 
numerical values of <f>(r) when there are a large number 
of state variables and when a digital computer is avail- 
able. There are also analytic ways of computing < I >( t ). 5 ’ 7 


(Pi°[X(/ 0 )] = Min Ap[X d - X(/„ + r)] 
M{t o) 


<JV + i°[X(*o)] = Min \{p[X d - X(to + t)] + dV[X(Z 0 + r)]} 
M(to) 


0 < Mi(to) < Mi. 

f 


( 20 ) 


In each stage of the iteration (20), the optimal control 
signal M°(to) is determined as some function h N of 
X(t 0 ). As N — r oo , it can be shown under various restric- 
tions 6 ’ 13 that fiV° converges to (P°, and h N converges to h. 

III. Dynamic Programming in the Linear Case 

The ease or difficulty of carrying out iterations (20) 
is determined largely by the complexity of the dynamics 
of the reaction and by the limits imposed on the control 
variables. To illustrate these computations concretely, 
we consider now the very special (but practically im- 
portant) linear case where 

1) The reaction dynamics are governed by an ordinary 
linear differential equation with constant coefficients 

2) There are no amplitude constraints on the control 
variables. 

Linear differential equations arise when the dynamic 
equations (6) are linearized about some equilibrium 
state X * and the corresponding values of the control 
variables M*. If we let 

X = X* + x, M = M * + m, and X d = X* + x d , (21) 

and, if the deviations x, m from the equilibrium values 
are sufficiently small, (6) leads to the linear differential 
equation with constant coefficients 

dx/dt = Fx -p Dm, (22) 

where F is a constant nXn matrix and D is a constant 
nXl matrix. The elements of these matrices are de- 


When m(t) is constant during the intervals between 
sampling instants, (24) takes the simpler form 

x(to + r) = 4>(t) X (to) + A(t)m(to), (26) 

where 

A (t) = ^ 4>(r — a)Ddo. (27) 

J o 

Eq. (26) is the explicit form of (6) in the linear case. 

We now give a formal derivation of the explicit equa- 
tions for accomplishing the iterations indicated by (20). 
The yarious formal steps of the derivation can be justi- 
fied under mild mathematical restrictions. 6 

If p is given by (8) and (P by (13), it can be shown by 
induction that the optimal performance index may be 
written in the form 

GV[x(/o)] = x' (to) P Nx(to) — 2x'(to)RNX d 

+ x d 'S N x d (N > 0) (28) 

P N , R n , Sn being nXn, nXl, and 1X1 matrices, respec- 
tively, and 

Po = Ro = So = 0 . 

For simplicity, we now drop the arguments of <L(r) 
and A(t). Using (20) and (26), we calculate the deriva- 

14 E. A. Coddington and N. Levinson, “Theory of Ordinary Dif- 
ferential Equations,” McGraw-Hill Book Co., Inc., New York, 
N. Y„ ch. 3; 1955. 
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tive of the scalar 6 V +1 [x(^ 0 ) ] with respect to the vector 
m(to). This is a vector (with components d6V+i/dra;(U)) , 
which is given by: 

d(PiV+i/dw(/o) = — 2 [A'(i?jv T Q)x d — A'(Pjst T Q) ($x(t 0 ) 
+ A m(t 0 ))] (N > 0). (29) 

Now (Pv+i[^(^o)] is evidently a quadratic function of 
each of the incremental control variables mftf). It fol- 
lows that (Pat+i has a single extremal value [which may 
be a minimum or a maximum, depending on the value of 
x(to) ] at that value of m(t 0 ) which makes the right-hand 
side of (29) zero. It can be shown 6 that the extremal 
value is a minimum for every x(t 0 ). Hence, m(t 0 ) is 
found by setting (29) equal to zero, which yields the fol- 
lowing expressions for m°(t 0 ) and the matrices defining 
( Pjv+i : 

m°(t 0 ) = — AnxQo) + B N x d , (30) 

where 

An = [A'(.fV + 0A]-‘(P* + 

Bn = [A ’(P N + Q)A]~'(Rn + 0 / ' 

With some further calculations, using (30) and (31) we 
find that: 


The objective is to convert raw materials A and B by 
means of reaction (i) into C obtaining as much quantity 
of C as possible. The optimization of the processus 
complicated by the undesired side reaction (ii) which 
produces the contamination product D. Under steady- 
state conditions, the resulting concentrations in the out- 
flow as a function of the “hold-up 7 ’ time (reactor volume 
/outflow rate) will have the qualitative shape shown in 
Fig. 4. 



Bn+i — X(4> — AAn)'(Pn + 04>; (32a) 

Rn+1 = X(4> - AA N )'(R N + Q)- (32b) 

and 

Sn+i = \(S N + Q — Bn'A'(Rn + Q))- (32c) 

The iterations indicated by (32) can be readily per- 
formed on a digital computer. Note that S# need not be- 
computed if only the optimal control vectors are of in- 
terest. In the limit o, all quantities in (32), except 
Sn+ i, may be shown to converge under certain restric- 
tions on F, D, and X . 6 

We get by inspection of (30) the important result: In 
the linear case, the optimal control variables are linear func- 
tions of the actual and desired states of the reactor. 

Since the control variables are linear functions of the 
state variables, it follows that under closed-loop control 
the reactor is a linear dynamic system. It can be shown 6 
that the only possible type of limiting behavior in such 
systems as t — » <» is for the state X (t) to converge to an 
equilibrium state X*. Since dynamic optimization in- 
cludes static optimization, it follows at once that: In the 
linear case, the states of a dynamically optimized system 
tend asymptotically to the same equilibrium state X*° 
which is obtained under static optimization. 

Example: As a numerical illustration of the results 
obtained by the use of dynamic programming in the 
linear case, let us consider the following hypothetical 
reactions: 


We now derive the analytical form of the dynamic 
equations of the reactor using the assumption that the 
Arrhenius equation ( 1 ) holds. Denoting the concentra- 
tions of A, • • • , D by Xi, • • • , Xi, and the flow rates 
of A and B by M\, M 2 we find, using conservation of 
mass, that: 

dXr/dt = - ki(T)XiXt + (Mi/V)Th 

- [(Mi + Mf)/V\X 1 ; 

dX t /dt = - kx(T)XiX 2 - 2 k 2 (T)X 2 2 X s + ( M 2 /V)U 2 

- [(Mi + M 2 )/V}X2\ 
dXz/dt = ki (T)X iX 2 ~ k2(T)X 2 ‘ 2 X 3 

- [(Mi + Tf 2 )/F]X 3 ; (33) 

and 

dXt/dt = 2ki(T)X2 2 Xz - ([Mi + M 2 )/V]X i . 

Let Ti and T 2 denote the temperatures of the input flows 
Mi, M 2 ; let T c be the average cooling water temperature 
inside the cooling coils of the reactor; and let h be the 
corresponding average heat transfer coefficient per unit 
cooling water flow. Furthermore, let Hi be the heat 
generated per molecule of the first reaction ; H 2 the heat 
generated per molecule of the second reaction; p the 
average density of the material in the reactor; and c 
the average heat capacity of the material. Denoting the 
temperature in the reactor by X 5 and the cooling- water 
flow rate by M 5 , conservation of energy yields 


(i) 


kfT) 

A + B — -4 C 


HP) 


* 2D. 


dX h /dt = ki(T)XiX 2 Hi + ^ 2 (r)X 2 2 X 3 F 2 

+ (. Mi/Vpc)(T 1 - X 6 ) + ( M 2 /Vpc)(T 2 - x 6 ) 

+ (h/Vpc)M*(T e - X 5 ). (34) 


(ii) 2B + C 



114 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


At equilibrium, the variables entering (33) and (34) 
are assumed to have the values shown in Table I. Note 
that the first reaction is assumed to be exothermic, and 
the second is assumed to be endothermic. 

Using these values, (23) yields the following numerical 
values for the matrices describing the dynamics of the 
reactor in the vicinity of equilibrium: 


" — 0.325 

-0.5625 

0 

0 

-0.200" 

-0.225 

-0.8125 

-0.014286 

0 

-0.368 

0.225 

0.4875 

-0.107143 

0 

0.116 

0 

0.1500 

0.014286 

-0.1 

0.168 

_ 0.450 

0.7500 

-0.035714 

0 

— 0.060_ 


Assuming that only the flow rates M\ and M s can be 
changed to effect control, we get: 


D = 


" 55 0 0 “ 

- 4 0 0 

-21 0 0 

- 3 0 0 

2 0 35. 5_ 


(36) 


Using a sampling period r = l, the transition matrix 
for the linear system can be obtained using the Taylor 
series (25). 


TABLE I 

Values of Reactor Constants 


Ui U t 

Mp 

Mp Mp 

X,* X 2 * 

x 8 * x 4 * x 8 * 

65 59 

0.05 

0.05 0.1 

10 

4 

21 3 120 

h(Xp) 


kJXp) 

dki(XJ) /dX s 

, h(Xp)/dX 5 

0.05625 


0.00044643 

0.005 

0.00025 

Ti 

r 2 

T c Hr 

Ho. 

V 

h/Vpc pc 

100 

100 

49 2 

-5 

1 

0.5 10 


To improve the operation of the reactor, it is desirable 
to increase the yield of C and cut down the yield of D. 
Therefore, the desired state of the reactor may be de- 
fined as: 


xP — 5 .and xp = — 1. (41) 

If the reactor starts out at the old equilibrium state 
(xi = X 2 = ■ • • =X 5 = mi= • • • = 7^3 = 0) at time t o = 0, 
the behavior of the state and control variables as a 
function of time will be as shown in Fig. 5. It is evident 
from Fig. 5 that it is possible to achieve almost exactly 
the new desired state and that the new equilibrium 
state can be reached rather quickly. It should be noted, 
however, that the results are valid only if the linearized 
approximation of the dynamics is valid. 


" 0.7407 

-0.3581 0.00498 

0 

-0.0909“ 

-0.1793 

0.4095 -0.00448 

0 

-0.2197 

0.1566 

0.2895 0.89510 

0 

0.0264 

0.0151 

0.1373 0.00946 

0.9048 

0.1288 

_ 0.3004 

0.3872 -0.03460 

0 

0.8172_ 

" 48.216 

0 -2.169 " 




- 7.695 

0 -4.682 




-15.744 

0 0.9124 




- 3.089 

0 2.5132 




7.056 

0 32.822 _ 





(37) 


(38) 


The peformance index is defined as: 

oo 

(P = 2 l x * d ~ 3s(/o + &)] 2 + [%P ~ *4(/o + £)] 2 - (39) 

*= i 

The optimal control variables for this performance 
index are given by the following functions of the desired 
and actual state of the reactor: 


IV. Limitations of the Linearity Assumption 

There are a large number of problems which must be 
considered before fully automatic dynamic optimization 
of chemical reactions can take place. 

1) If state variables are not physically measurable, 
they must be generated artificially in order to be able 
to compute the optimal values of the control variables. 



"0 

0 

-0.0680 

0.0217 

0“ 


"0.0102 

0.0164 

0.0605 

-0.0197 

-0.0012" 

m = 

0 

0 

0 

0 

0 

x d + 

0 

0 

0 

0 

0 


_0 

0 

0.0788 

0.3932 

0 _ 


_0 . 0064 

-0.0327 

0.0668 

-0.3537 

— 0.0504_ 
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SECONDS 


Fig. 5. 


This calls for simulating some of the reaction dynamics 
as an integral part of the control system. This, too, can 
be done by means of a digital computer. (Most of the 
analog-type control instruments used at present may be 
thought of as performing essentially this function.) 

2) The dynamic programming equations can be solved, 
practically speaking, only in the linear case. In reality, 
of course, the reaction dynamics are nonlinear. More- 
over, they may change with time due to uncontrollable 
or unknown effects. There are essentially two possibili- 
ties of attacking these problems. 

a) The reaction dynamics are linearized over a certain 
region in state space. The reaction is then optimized on a 
linear basis, computing the dynamic programming equa- 
tions in real time. If, as a result of this optimization, the 
state moves into another region of the state space, an- 
other set of linearized equations is obtained to describe 
the dynamics in the new region. These equations are 
then used to obtain a new dynamic optimization, etc. 
This method of attack is closely related to the problem 
of designing adaptive or self-optimizing systems 15 about 
which little is known at present. The chief difficulty is 


the rapid and accurate determination of the linear dy- 
namics in the presence of measurement noise. 

b) The dynamic optimization is solved directly by 
purely numerical methods. The chief difficulty encoun- 
tered here is the experimental measurement and rep- 
resentation of the reaction dynamics in a nonlinear form. 
Very little is known about this problem at present. 

c) The control variables cannot be chosen freely but 
must lie within certain prescribed ranges; in other 
words, the control variables “saturate.” The problem of 
designing a control system where the dynamic equations 
of the control object are linear but where the control 
variables saturate has an extensive literature usually 
under the subject heading of “Optimal Relay Servo 
Problem.” At present this problem is solved only in the 
case where 1) the dynamic equations are of the second 
order and 2) there is only one control variable. 16 Using 
the point of view of this paper, a rigorous method was 
recently obtained (which is not subject to the above re- 
strictions, 1 and 2) 8 for the computation of the opti- 
mal control variables; however, this method is very in- 
efficient. When the control object has nonlinear dy- 
namics, no method of computing the optimal control 
variables is known. 

Despite these obstacles, much progress can be ex- 
pected from the utilization of the “state” method of 
describing reaction dynamics combined with dynamic 
optimization as presented in this paper. These new ideas 
will probably be most helpful in attempting to control 
(by means of real-time digital computation) dynamic 
systems which have many state variables. 

List of Principal Quantities 
Sections II-A and II-B 

U ; Ui = vector denoting concentrations in input 
streams; its components. 

M\ Mi = control vector; control variables. 

1 = number of control variables. 

T = temperature. 

X; X{ = state vector; state variables (concentrations 
and temperature inside reactor). 
n = number of state variables. 
t\ £o = time; initial time. 

Section II-C 

f; fi = infinitesimal state transition function; its 
components. 
r = sampling period. 

0; </>i = (finite-interval) state transition function; its 
components. 


15 R. E. Kalman, “Design of a self-optimizing system,*” Trans. 
AS ME, vol. 80, pp. 468-478; 1958. 


16 R. E. Kalman, “Analysis and design principles of second and 
higher-order saturating servomechanisms,” App. 2, Trans. AIEE, 
vol. 24, pt. 2, pp. 294-310; 1955. 
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Section II-D 

p = distance function in state space (pseudo- 
metric) . 

' = transpose of the matrix. 

Q = positive semidefinite matrix. 

( )* = equilibrium values. 

( )° = optimal values. 


Section II-E 

(P = performance index. 
h] hi — optimal control function. 


Section III. 

x; Xi = incremental state vector; incremental state 
variables. 

m; mi — incremental control vector; incremental con- 
trol variables. 

F= infinitesimal transition matrix in linear case. 

D = matrix denoting instantaneous effect of con- 
trol variables in linear case. 

<£(r) = (finite-interval) transition matrix in linear 
case. 

A (r) = matrix denoting effect of control variables 
in linear case (finite-interval). 

An, Bn, Pn, Rn, Sn constant matrices. 


Simulation of Human Problem-Solving 

W. G. BOURICIUSf and J. M. KELLERf 


S IMULATING human problem-solving on a digital 
computer looks deceptively simple. All one must 
do is program computers to solve problems in such 
a manner that the computer employs the identical 
strategies and tactics that humans do. This will prob- 
ably prove to be as simple in theory and as hard in 
actual practice as was the development of reliable 
digital computers. One of the purposes of this paper is 
to describe a few of the pitfalls that seem to lie in the 
path of anyone trying to program machines to “think.” 

The first pitfall lies in the choice of an experimental 
problem. Naturally enough the problem chosen should 
be of the appropriate degree of difficulty, not so difficult 
that it cannot be done, and not so trivial that nothing 
is learned. It should also involve symbology and manip- 
ulations capable of being handled by digital computers. 
At this stage of problem consideration, a devious form 
of reasoning begins to operate. Usually the people en- 
gaged in this type of research will have had a thorough 
grounding in conventional problem-solving on com- 
puters. Consequently, they are conversant with the full 
range of capabilities of computers and have an apprecia- 
tion of their great speed, reliability, etc. They also know 
what kinds of manipulations computers do well, and 
conversely, what kinds of things computers do in a 
clumsy fashion. All of this hard-earned knowledge and 
sophistication will tend to lead them astray when the 
time arrives to choose a problem. They will try to make 
use of this knowledge and hence choose a problem that 
will probably involve the simulation of humans solving 
problems with the aid of computers rather than the 

f IBM Res. Center, Yorktown Heights, N. Y. 


simulation of humans solving problems with only paper 
and pencil. Consequently, the characteristics of present- 
day computers may confine and constrict the area of 
research much more than is desirable or requisite. What 
is liable to happen, and what did happen to us, is that 
the experimental problem chosen will develop into one 
of large size and scope. If this always happens, then 
those human manipulative abilities that are presently 
clumsy and time-consuming on computers will never 
get programmed, simulated, or investigated. For- 
tunately for us, the two experimental problems we 
chose were of such a nature that they could be easily 
miniaturized, and this was done as soon as the desira- 
bility became apparent. 

The second pitfall which must be avoided is the as- 
sumption that one knows in detail how one thinks. This 
delusion is brought about by the following happen- 
stance. People customarily think at various levels of 
abstraction, and only rarely descend to the abstraction 
level of computer language. In fact, it seems that a 
large share of thinking is carried on by the equivalent 
of “subroutines” which normally operate on the sub- 
conscious level. It requires a good deal of introspection 
over a long period of time in order to dredge up these 
subroutines and simulate them. We believe people as- 
sume that they know the logical steps they pursue when 
solving problems, primarily because of the fact that 
when two humans communicate, they do not need to 
descend to the lower levels of abstraction in order to ex- 
plain to each other in a perfectly satisfactory way how 
they themselves solved a particular problem. The fact 
that they are likely to have very similar “subroutines” 
is obvious and also very pertinent. 
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The third pitfall consists of the following fact: any 
problem chosen as an experimental vehicle is likely to 
produce interesting results if the program is successful. 
We consider these results to be byproducts of the gen- 
eral problem of studying the methodology of problem- 
solving, though they do serve as a test for success of the 
methods employed. As these byproducts accumulate, 
one is increasingly tempted to spend a disproportionate 
amount of time on obtaining useful and/or interesting 
byproducts. This diverts the researchers and they make 
little progress along the lines originally intended. 

The class of problems we chose was the following: 
given a set of elements, and a criterion of compatibility 
between any two of its elements, find a subset of mu- 
tually compatible elements satisfying given constraints. 
This covers a large class of problems to which no satis-" 
factory analytic solutions are known. An example 
would be: given the set of all airplanes near an airport, 
find the largest subset of those which are on compatible 
(i.e., noncollision) courses. 

The first experimental problem we chose consisted of 
this: a word list was given together with a table of 
synonyms for each word. The test of compatibility be- 
tween any two words on the list was provided by a 
speech-recognition machine which was not as discrim- 
inating as the human ear. The problem was to find a 
word list the same length as the original list, but with 
synonyms substituted wherever necessary so that all 
the words on the resultant list could be unambiguously 
identified by the speech-recognition machine. The diffi- 
culty encountered in substituting the synonyms is that 
each substitution may cause added incompatibility rela- 
tionships. The final list of mutually compatible words 
would be a practicable working vocabulary for voice 
control of, say, an air defense center. 

We decided that the following three heuristic methods 
would probably be used by humans. These are: 

Method 1— test the first word against all the follow- 
ing words. Wherever two words are incompatible, sub- 
stitute synonyms for the second word until a compatible 
synonym is found, then proceed. Repeat with the second 
word of the list. After the last two words are tested, 
reiterate. Eventually a word list meeting the compati- 
bility criteria will be found, or else the tables of syn- 
onyms will be exhausted. 

Method 2 — this method is basically the same as 
method 1, with this added sophistication: whenever 
a compatible synonym is substituted, it is immediately 
tested further for compatibility with all words in the 
list occurring before the particular two words concerned. 

Method 3 — determine which words are incompatible 
with the largest number of other words in the list and 
substitute synonyms for these highly incompatible 
words first. Then reiterate. 

Two submethods also suggest themselves as processes 
to apply prior to trying the above methods. These are: 

Submethod 1 — sort the words on the first phoneme. 


Submethod 2 — try a batch procedure: divide the 
original list into two or three parts, apply one of 
the three main methods to each in turn, then put 
the batches back together before trying the final 
manipulation. 

All of these methods were programmed and put to- 
gether in a master program which determined the se- 
quence of the application of each of the methods and 
submethods. Quite naturally, this sequence was: first, 
the “quick and dirty” method 1 ; then, the more sophis- 
ticated method 2; and last, the more complicated and 
most powerful method 3. At the start, each of the meth- 
ods was allotted a certain amount of time, and if no 
compatible word list was produced within the allotted 
time, then the master program switched to the next 
method. This procedure was modified so that the time 
was extended if an analysis showed that the method had 
a good chance of successfully producing a compatible 
word list. To determine which submethod to employ 
first, a random choice was made, and lack of success 
automatically switched control to the other submethod. 

To simulate these methods we employed a random 
number generator to generate 18-bit pseudo words, each 
consisting of three 6-bit pseudo phonemes. Inasmuch as 
the English language contains approximately 43 
phonemes, 6-bit pseudo phonemes can reasonably be 
expected to represent adequately human phonemes. The 
human ear was considered capable of distinguishing be- 
tween any two 6-bit pseudo phonemes that differed in 
one or more bit positions. The hypothetical speech-rec- 
ognition machine, being not so discriminating, was con- 
sidered capable of differentiating between any two 6- 
bit pseudo phonemes if and only if they differed in two 
or more bit positions. Mathematically stated, two 
pseudo phonemes Pi and Pj are considered compatible 
whenever 

W(P t +. Pf) > 1 

where the weight function, W, merely counts the num- 
ber of ones in the argument, and the operation +. be- 
tween the two pseudo phonemes is bit-by-bit addition 
modulo two, which is equivalent to exclusive OR. 

Our experimental byproduct results are given in 
Table I, where the “word-list length” is defined as being 
the length of word lists that have a 50 per cent chance 
of being satisfactorily manipulated. 

The second experimental problem consisted of finding 
the largest mutually compatible set of 12-bit numbers 
satisfying the following criterion of compatibility: 

W(A t +.Aj) > 5. 

This set will have the characteristics that double error 
detection and correction is possible when employing it 
as an information transmission code. 1 

1 R. W. Hamming, Bell Sys. Tech. J., vol. 29, pp. 147-160; 
April, 1950. 
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TABLE I 

Effectiveness of Several Methods of Finding Compatible 
Word Lists 


Method 

Word- List Length 

Average Computing 
Time in Minutes 

1 alone 

325 

0.65 

1+sub 1 

300 

0.74 

1-j-sub 2 

325 

0.78 

2 alone 

550 

3.70 

2+sub 1 

525 

2.55 

2-j-sub 2 

550 

2.95 

3 alone 

740 

3.82 

3+sub 1 

740 

3.94 

3-j-sub 2 

740 

4.50 

Master Program 

750 + 

Dependent on 

Word List Length 


While in general the resulting code is nonconstruc- 
tive, in this paper we confine ourselves to a subgoal, 
namely, the results obtained when constraining the set 
to be a constructive group code. With this constraint, 
the original compatibility criterion of W{Ai~\~ .A,)>2E 
+ 1, where E is the number of errors to be detected and 
corrected, can be reduced to the following set of con- 
straints on binary numbers whose lengths are those of 
the check bits only; the so-called parity check matrix: 2 

W(Ci) > 2E - 1 
W{Ci + . Cj) >2E- 2 
W(Ci+.Ci+.C k ) >2E-3 


W(C i +.C j +.C k + • • • • C n ) > 0 . 

The size of the set of C’s determines the maximum 
number of information bits that can be handled by 
check bits whose formulas for construction are given by 
the columns of the C’s. These constraints were pro- 
grammed and the size of the matrix determined by ex- 
haustion. 

The experimental byproducts are given in Table II. 

Having satisfied your curiosity regarding the possi- 
bility of obtaining useful and/or interesting results 
from these methods, we now wish to discuss those as- 
pects that are pertinent to the simulation on computing 
machines of human problem-solving strategies and 
techniques. 

In the beginning we had what we thought were very 
good ideas of how to proceed. We had solved all of the 
problems on a medium level of abstraction. We soon 
learned, however, that an abstraction is nearly always 
a refuge from accurate knowledge. Explaining to each 
other how to reach a particular goal did not teach us 
how to program the simulation of the manipulations in- 

2 D. Slepian, Bell Sys. Tech. /., vol. 35, pp. 203-234; June, 
1956. 


TABLE II 

Double Error-Correction Code Lengths 


Number of Check Bits 

Number of Information Bits 

6 

2 

7 

4 

8 

> 9 

9 

>13 

10 

>16 


volved until we descended the abstraction ladder to the 
level of the symbolic coding process we were employing. 
For example in submethod 1, sorting the words accord- 
ing to the size of the leading 6-bit phoneme is useless. 
For detecting 1-bit differences it is a useful procedure, 
but for detecting 2-bit differences, one must order the 
words in an order that bears a useful relationship to the 
compatibility criterion. In our first problem we found 
this relationship in the weight of the phonemes, and 
accordingly arranged the words in order of increasing 
weight of the first phoneme. The result was a quicker 
finding and weeding out of incompatible words in iso- 
lated cases. 

A more subtle difficulty arose in the second of our 
problems. Consider the following five beginning mem- 
bers of a larger set satisfying the constraints previously 
described : 

1) 11 11 00 00 

2) 00 11 11 00 

3) 00 00 11 11 

4) 11 00 00 11 

5) 10 10 10 10. 

If element number 5 was tried and discarded because 
it was found not to yield a sufficiently large set of C’s, 
then the number 01 01 01 01 need not be tried because 
it will yield similar unsatisfactory results. When doing 
this by hand on a piece of paper, one can immediately 
detect the equivalence between the two trial numbers 
for the fifth element. The criterion for this equivalence 
is that the two sets of five numbers containing different 
fifth elements are identical upon column permutation. 
It took us literally days to describe this simple relation- 
ship to the 704 computer whereas it takes only a minute 
to describe it to you. One of the reasons, of course, is 
that this concept involves two-dimensional visualiza- 
tion at which humans are particularly good and 704’s 
are particularly inept. As a result, we wasted a lot of 
time trying to circumvent the transposition of the 
matrix elements in order to avoid using an excessive 
amount of machine time. A second-level difficulty arose 
when it was realized that all of the possible equivalences 
were not discovered by the original code, which did not 
contain column transposition. Consider, for example, 
the situation when the fifth member is 10 10 11 00. Then 
the number 10 10 00 11 is equivalent, but an involved 
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column permutation had to be programmed in order to 
detect such an equivalence. 

In conclusion, we would like to give our motivation in 
trying to simulate human behavior. At the present 
time, there are many problems whose nature is such 
that machines do not handle them successfully even 
though they are regularly solved by humans. 3 There- 
fore, it seemed desirable for us to study a “working 
model” in action, to determine its component functions 
and to analyze each function to find out what part it 
played in the solution of these difficult problems. Our 
main goal is to be able to solve these kinds of problems, 
partly by new and different programs, and partly by 

3 A. Newell and H. A. Simon, “Current Developments in Complex 
Information Processing,” RAND Rept. ; May 1, 1956. 

H. L. Gelernterand N. Rochester, “Intelligent behavior in prob- 
lem-solving machines,” IBM J. Res. Dev., vol. 2, pp. 336-345; 
October, 1958. 


new and different machine-instruction sets. We do not 
insist that the machines do all the work, because we are 
convinced that a combination of a human and a ma- 
chine working in tandem will always have superior prob- 
lem-solving powers than either working alone. To com- 
plement each other, better communication is required 
between humans and machines, and this implies com- 
munication on higher abstract levels than the ones now 
employed. In order for this to be possible the machines 
must have either wired-in or programmed “subroutines” 
to interpret the directions and/or suggestions given 
them by their human colleagues. These “subroutines” 
should overlap those of humans in order to be useful. 
We feel that this goal will be furthered by continued re- 
search along the lines described in this paper. 

The authors would like to express their sincere thanks 
to N. Rochester for suggesting the problems and out- 
lining the heuristic programming methods utilized. 


The Role of the University in Computers, Data 
Processing, and Related Fields* 

LOUIS FEINf 


Introduction 

S INCE the Fall of 1956, the author has been study- 
ing the genesis and operation of university pro- 
grams in the fields of computers, data processing, 
operations research, and other relatively new and ap- 
parently closely related fields. The specific purposes 
were : 

1) To study and evaluate the organization, curricu- 
lum, research program, computing equipment, 
financing, and facilities of universities in the 
United States having computer and/or data 
processing and/or related programs. 

2) To identify those fields of study (some already ac- 
cepted and identified as disciplines as well as those 
not yet so designated) that are unambiguously 
part of the computer and data processing fields 
and those closely related fields that might legiti- 
mately be part of a university program. 

3) To appraise the role of the universities in these 
fields and to determine what universities might 
do to build distinguished programs in these fields. 

* Part of this work was undertaken when the author was a con- 
sultant to Stanford University, 
t Consultant, Palo Alto, Calif. 


During the course of the study it became clear that a 
university program in any field is an important function 
of the role of the university in society itself. The identi- 
fication of this role thus became a fourth separately 
identifiable purpose of the study. 

Source information was obtained by formal inter- 
views and informal sessions with university administra- 
tors, directors of computing centers, faculty members, 
students, industrial representatives, and other inter- 
ested persons. Places of interview were universities, 
scientific meetings, social gatherings, industrial plants, 
and research institutes. Important information was ob- 
tained from a few publications. 

A questionnaire intended for mailing was considered 
and dropped because it became clear very early that 
only personal interviews could bring out the important 
facts and opinions. Hence, the conclusions and recom- 
mendations are not always derived from a mass of ac- 
cumulated data. They do reflect the author’s experience 
in the computer and data processing field, information 
about what universities are doing and especially what 
they are not doing, and the influence of those individuals 
interviewed who have experienced and reflected seri- 
ously on the same problems as those considered in this 
study. 
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Study and Evaluation 
Modern U. S. Culture 

In response to the increasingly large and complex 
problems faced by society, our culture has changed in 
several important respects. For their solution, these new 
problems require new techniques and sometimes refine- 
ment of already developed techniques. The require- 
ments for . training people in new fields (and in older 
ones) and for research, design, and development are 
difficult for us to cope with. In fact, we have, as a na- 
tion, sought solutions to our problems in training, edu- 
cation, research, design, and development, from what- 
ever institutions or individuals would make themselves 
available. On this account, traditional roles of institu- 
tions in our society are changing, and in many cases 
transformations are almost complete. 

In the past, industry, government, and the university 
were each roughly assigned specific fields and subject 
matter in which they were active almost exclusively. 
Research and instruction in the “intellectual disciplines” 
were the province of the university, and design and 
development belonged mainly to industry and govern- 
ment. Today, one can sometimes scarcely identify 
teachers, researchers, designers, and development peo- 
ple as members of a university, industry, or govern- 
ment, for each is involved to a considerable extent with 
all three. Nor can they be identified by their (over- 
lapping) fields of interests. For example, an information 
theorist may just as well be found at Bell Telephone 
Laboratories as at the Massachusetts Institute of 
Technology. 

Specifically, the milieu of our society is characterized 
by a demand that more phases of our traditional culture 
be subjected to a critique guided by rationalistic and 
scientific rules of evidence. Thus, 

Information-Data — gathering, storage, searching, 
classification, cataloguing, retrieval, encoding, de- 
coding, interpretation, sampling, filtering, analyzing, 
checking; 

Models — mathematical, Boolean algebraic, Post alge- 
braic, logical, stochastic, computational, statistical; 

simulators, computers, automation, language trans- 
lation, switching theory, information theory, coding 
theory, cybernetics, decision theory, statistics, opera- 
tions research, econometrics, psychometrics, manage- 
ment science, linear programming, game theory, au- 
tomata theory, artificial intelligence, self-organizing 
machines, adaptive mechanisms, neural psychology, 
learning machines, and numerical analyses 

are examples of topics — some old, some new — that are 
having a terrific impact on and general acceptance by 
the industrial, government, and university community. 

This acceptance is due in no small measure to the 
spectacular successes enjoyed by some of these fields 
when applied to the solutions of problems in many of 


the crash programs that characterize the present genera- 
tion. The following excerpt from the April, 1956 issue 
of the Journal of the Institute of Management Science 
characterizes a phase of the milieu: 

Primarily, what are needed are methods for grappling with multi- 
variable situations as they occur in the world rather than the single- 
variable methods of classic laboratory science. Hopefully, it should 
be possible to start with qualitative or rough quantitative approaches 
and gradually develop tools of greater precision. This suggests the 
use of mathematical models in both their qualitative and quantitative 
aspects. Techniques grounded in mathematics, notably including 
linear programming, search theory, and game theory (particularly 
non-zero sum multi-person games) have outstanding potential con- 
tributions. Marketing budgets involve commitments on too large a 
scale to permit executives to continue to accept historical patterns 
in a management world in which rational explanations of alternative 
strategies and planned optimizing are becoming standard operating 
procedures. 

Industry, business, and government have varied re- 
quirements for trained people to do the work requiring 
talents in these fields. Need for investigating these fields 
from the scholar’s viewpoint has also been recognized 
although support for this endeavor lags. The problems 
of coping with practical problems in and with these 
fields, training professionals, training scholars, and do- 
ing research, have been handled in a variety of ways by 
industry, government, and the university when indeed 
they have tried to handle the problems at all. On the 
whole, each of these three important segments of the 
community has been unprepared by virtue of its tradi- 
tional organization, philosophy, responsibilities, and 
procedures to incorporate and cope with these new situ- 
ations — situations whose fundamental implications 
usually require a modification of their organizations, 
philosophies, responsibilities, and procedures. 

Thus, the government has set up RAND Corpora- 
tion (s) for research and project work; it has set up and 
supported institutes at universities for research and 
project work; it has indirectly supported development, 
if not exploratory work, it these fields as byproducts 
of government contracts to industry. 

Industry and business have set up separate depart- 
ments charged with over-all responsibility of applying 
these techniques to company operation, management, 
design, manufacturing, sales, etc. Graduate-level schools 
for instruction of professionals in these fields are being 
run by industry itself. 

The scholars and practitioners in these new fields are 
uncertain both as to the nature and structure of the 
fields and their relation to each other. As would be ex- 
pected, new societies and magazines devoted to these 
fields have sprung up. The following quotes (from 
Transactions of the Institute for Management Sciences) 
bespeaks the growing pains of the community of practi- 
tioners and scholars in these youthful fields: 

The writer has a strong feeling that those who worked so hard and 
successfully to organize TIMS were on the right track. However, it 
appears that they were led more by broad feelings than by logical 
reasoning. They knew (in the sense of having faith) that there was a 
need for something not yet in existence but they failed to develop a 
clear and distinct picture of what thev intended to accomplish. 
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Further, the “Statement of Editorial 'Policy” from the 
September, 1957 issue of the Journal of Information 
and Control, 

The theories of communication, computers, and automatic control 
were initially of interest primarily to mathematicians and engineers. 
The papers relating to these fields have therefore appeared in a wide 
variety of engineering and mathematical journals. Some of the results 
of this work have been applied by scholars in such areas as linguistics, 
psychology, statistics, physics, genetics, neurophysiology, and even 
philosophy. Such applications have led to results which are of interest 
in their own areas, but which in some instances have also suggested 
models and raised questions of fundamental interest to the theories 
being applied. Unfortunately, they appear in many entirely different 
sets of professional publications, which few readers of engineering 
and mathematical journals see. 

It is the purpose of this new journal, Information and Control, to 
publish papers which make significant contributions to the theories 
of communication, computers, and automatic control and also papers 
which present experimental evidence or theoretical results bearing on 
the use of ideas from such theories in any field to which the ideas are 
relevant. Papers will be published, for example, on such topics as: 

Theory of communication 

Theory of automata 

Theory of automatic control systems 

Description and analysis of language and other natural informa- 
tion sources 

Communications and control systems which may include links 
within or between organisms 

Information aspects of physics and of the theory of observation 
and measurement 

Organization, processing and retrieval of data. 

Any statement of policy for a new enterprise is, at best, an educated 
guess, and this one is no exception. It is, of course, subject to gradual 
change without formal notice, as the Journal takes shape under the 
diverse pressures of authors, editorial board, editors, and readers. 

The Role of the University 

The universities, as institutions, are having a hard 
time learning to cope with their new role in society in 
general and in particular learning how to effectively in- 
corporate these new fields into the academic structure. 

Policy 

Probably 150 universities and colleges are engaged in 
some kind of activity in the fields of our concern. The 
purpose of the schools that first got involved was “to 
get their feet wet” in fields that seemed to have appeal. 
Usually, one man who had a strong interest in this or 
that field would give a course in whatever department 
he happened to be in. Several institutions have em- 
barked on a program of building large-scale digital com- 
puters. Some were successful. Some are now being built. 
Later, as the pressures for information in these fields 
grew, courses were added, some equipment was obtained, 
centers and institutes were established. Only a few uni- 
versities have made a determined effort to select a field 
of interest, set up a policy and goal, and implement it. 
Most were feeling their way. Most are now feeling their 
way. 

The most important impact on university programs 
in these areas has been the educational program of 
IBM. IBM has a manpower problem now; they know 
it will be severe in ten years. Their problem is two-fold. 
They need professionally trained people to help sell 
their product. They want their customers to have pro- 


fessionally trained people to use their product properly. 
IBM has “presented” 650’s to over 50 universities by 
now under the condition (among others) that a couple 
of courses in data processing and numerical analysis be 
given. A 709 has been “given” to U.C.L.A. for “empha- 
sis on the study of business management problems.” 
M.I.T. has been “given” a 704 for “research and educa- 
tion of students in computing techniques.” The Uni- 
versity of California has a retread 701. Sperry Rand, 
Burroughs, Bendix, Royal McBee, and a handful of 
other computer manufacturers have also “contributed” 
computers to universities (see the Appendix). 

It is fair to say that, in many cases, to the extent that 
a university computer activity has a purpose at all, it 
has been made for them by IBM. It is true that for 
many universities, this is good. Otherwise, they may 
never have developed a program at all. Nevertheless, 
there are no distinguished academic centers of com- 
puters, data processing, and related fields, and I believe 
that this is so because not enough attention has yet been 
given to the development of an integrated program and 
policy in response to the needs and conditions of the 
whole community rather than as a supplement to com- 
puters obtained at a “bargain.” 

The scramble to get in on a “free” 650 computer from 
IBM is a disgrace in some cases. Course titles and con- 
tents have been created on the spur of the moment to 
fit the IBM requirements. Faculty have been assigned 
on the basis of their not having a full teaching load — 
more evidence of what is done without a clear, well- 
defined policy and program. 

Organization 

The university entity most popular as the center of 
activity for computers, data processing, operations re- 
search, industrial engineering, mathematics, business, 
etc., is the computing service center. Sometimes an 
interdepartmental committee is in charge of the service 
and of a few courses. Sometimes, all courses are given 
in one department without any necessary relation be- 
tween course content and department. Some schools run 
one-week seminars. Others run symposia. In many 
cases, institutes, not altogether part of the academic 
structure, are involved. 

Faculty 

The range and variety of faculty participating in uni- 
versity computer and data processing programs com- 
pares favorably to the range and variety of the pro- 
grams themselves. Everyone from novices to people 
with ten years of experience is participating. Part-time 
instructors from industry and government have been 
used. Interest without regard to experience and com- 
petence is sometimes the main requisite of the teacher 
in many instances. With the exception of some instruc- 
tion in programming, courses designed for faculty 
training hardly exist. 
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There are several reasons for this unfortunate situa- 
tion. First, the fields are so new and the need so great, 
that not enough experienced and competent people have 
developed to handle this need. Secondly, many people 
who could most adequately fill the need are being more 
highly paid for their services by industry or by the 
government. Thirdly, competent researchers ordinarily 
have better facilities and equipment for work, which, 
in many industries and government, is identical to what 
he would be doing at a university. 

Curriculum 

A few selected examples of topics in computers, data 
processing, and related fields covered in the present 
curriculum of universities are: 

Use of electronic data processing 
Logical design of computers 
Computer electronics 

Electronic digital computers — digital computer cir- 
cuitry 

Theory of and operation of computing- machines 

Statistics in business forecasting 

Dynamic programming 

Numerical mathematical analysis 

Numerical mathematical analysis laboratory 

Matrix analysis 

Matrix analysis laboratory 

Numerical solution of differential equations 

Numerical solution of differential equations laboratory 

Principles of digital computers 

Programming for digital computers 

Business and industrial analysis 

Statistical methods — regression 

Probability models 

Linear programming 

Game theory 

Monte Carlo techniques 

Data processing 

Systems and analysis 

Information theory 

Switching and computing circuits 

Theory of coding 

Information storage and retrieval 

Documentation and classification. 

Such topics are covered not only in normal courses, 
but in special courses, seminars, symposia, and lectures. 

It may be seen that even this incomplete list covers 
aspects in the design and design models for computers, 
programming, and applications of interest to people in 
the business schools, engineering, the physical and bio- 
logical sciences, applied mathematics, statistics, indus- 
trial engineering, logic, etc. These courses reflect an 
appreciation of the ability of a computer to handle new 
computational models. They reflect an appreciation of 
the theoretical techniques of computer design. 

What is impressive is the lack of courses reflecting ap- 
preciation of the computer as an aid to routine mental 


effort, a theory of computers, a theory of programming, a 
theory of applications. This in turn is probably a reflec- 
tion of the youth of the fields. Such theories have not 
yet been born. When they are created and developed, 
courses will undoubtedly follow. 

Few, if any, of the university curricula are integrated. 
Even those that do have an aspect of integration — as 
the one at the Engineering School at the University of 
Pennsylvania, or the one at the Business School at 
U.C.L.A. — concentrate on the interests of people in 
particular disciplines interested in computers or data 
processing as tools for their own use exclusively. This 
is not to say that this is bad. But it is narrower than a 
university program might be. 

Research — Subject Matter 

The range and variety of potential research activity 
in the design, programming, and utilization of com- 
puters, and in closely related fields is enormous. 

Theses and research papers have been written on 
topics in computer logic, automatic coding, switching 
theory, coding theory, neurophysiology, inventory con- 
trol, production control, office automation, machine 
translation, organization, classification and retrieval of 
knowledge, and solutions of problems having computa- 
tional models in almost every quantitative discipline. 

Here, as in the curriculum, the fields of computer 
theory, application theory, model theory do not yet ap- 
pear to have been successfully attacked. 

Equipment Facilities 

Most of the equipment (see the Appendix) now pres- 
ent in universities in the United States has been selected 
on the basis of a single criterion, the cut-rate price, in 
spite of the fact that the selection committee would 
sometimes have preferred other equipment on the basis 
of technical and operational criteria. The effect of this 
on the quantity and quality of the research output can- 
not be measured. However, it is clear that the basis for 
selection of equipment ought to be the maximum benefit 
to the university community. It is difficult to see how 
any of the universities, now selling computing time to 
sponsored contracts (in order to pay the bill, to be sure) 
and having little time (usually the second shift) for 
legitimate academic research and instructional pursuits, 
will gain an outstanding reputation for solving univer- 
sity research problems with computational models. 
They will become known for providing bargains for 
service to sponsored (usually government) research. 
The main objection to this sort of thing is that the 
benefit to the university research staff derived from the 
activity is fortuitous and incidental rather than the 
planned, aggressive activity it ought to be and must be 
for maximum benefit to the proper university function. 

Some universities are still building their own com- 
puting equipment. There seems to be little reason to 
attempt this now, even at the present prices of com- 
mercially available computers. 
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Some Observations for a University Program 

Probably the most important reasons for the uni- 
versities’ inability to cope adequately with their new 
role in instruction and research in these fields, are their 
failure to identify clearly the role they want to play, and 
their failure to try to fit the new fields to their role. 
This section includes some observations that should be 
aids in the conscious selection by universities of their 
particular roles and of how to implement these selected 
roles in these new fields. 

The Function of the University 

The legitimate function of a university in any society 
has been the subject of ancient and continuing contro- 
versy. The controversy has revolved around the ques- 
tions loosely described as training vs education; prac- 
tical vs theoretical subject matter; routine vs non- 
routine activities; scholarly vs professional endeavor; 
and the various shades of grey in between. A. N. White- 
head stated and elaborated on one position at the dedi- 
cation of the Harvard Business School in 1927, 

This article will only deal with the most general principles, though 
the special problems of the various departments in any university 
are, of course, innumerable. But generalities require illustration, and 
for this purpose I choose the business school of a university. This 
choice is dictated by the fact that business schools represent one of 
the newer developments ... of university activity. . . . 

There is a certain novelty in the provision of such a school of 
training, on this scale of magnitude, in one of the few leading uni- 
versities of the world. It marks the culmination of a movement which 
for many years past has introduced analogous departments throughout 
American universities. This is a new fact in the university world; and 
it alone would justify some general reflections upon the purpose of a 
university education, and upon the proved importance of that purpose 
for the welfare of the social organism. 

The novelty of business schools must not be exaggerated. At no 
time have universities been restricted to pure abstract learning. . . . 
There is, however, this novelty: the curriculum suitable for a business 
school, and the various modes of activity of such a school, are still 
in the experimental stage. . . . 

These reflections upon the general functions of a university can 
be at once translated in terms of the particular functions of a business 
school. We need not flinch from the assertion that the main function 
of such a school is to produce men with a greater zest for business. 
. . . Business requires a sufficient conception of the role of applied 
science in modern society. It requires that discipline of character which 
can say “ yes” and “no” to other men, not by reason of blind obstinacy, 
but with firmness derived from a conscious evaluation of relevant alterna- 
tives. . . . 

Whitehead spoke well. If we substitute for the field of 
business, computers, data processing, and closely re- 
lated fields (let us call them the “computer sciences”), 
then universities may well select as their role: 

1) Training of professionals in these fields of “com- 
puter sciences” of interest and competence in the 
university. 

2) Training scholars in these fields. 

3) Doing exploratory research in these fields. 

4) Developing the subject fields into new disciplines. 

Disciplines and the Computer Sciences 

Many fields are now providing models for many other 
disciplines. Thus, work under the province of operations 


research, or game theory or decision theory or manage- 
ment science, or linear programming, or econometrics or 
statistics is being applied to business problems like 
market analysis, inventory control, long-range planning 
etc. In many instances, a computer is used to process 
data, solve equations, do analyses, even make decisions 
based on criteria it has been given. 

Switching theory, coding theory, information theory, 
Boolean algebra provide models not only for design and 
programming and applications of computers, but also 
of analogous fields like neurophysiology. 

The computer thus provides a significant link among 
various established disciplines as well as those fields of 
endeavor of intense present interest. Computers are re- 
lated to other fields in one or more of three ways: 

1) Workers in these fields use the computer as a 
mechanical or a mental aid. 

2) These fields provide models useful in the design, 
programming, and applications of computers. 

3) Analogies exist in the internal structure and or- 
ganization of a computer with structures and or- 
ganizations in other fields. 

It seems plausible to designate the fields mentioned 
above and those enumerated in the Introduction, as the 
“computer sciences” since they are related to each other 
in one or more of the three ways enumerated above. 

We must expect that some of these fields will coalesce 
and develop into disciplines on their own. These will 
then almost certainly be universally accepted as the 
legitimate province of the university scholar. Others 
may not turn out to be disciplines and will gradually be 
abandoned by universities. But not knowing now which 
field (s) will meet which fate, the university scholar must 
presently be interested in all of them. 

The term “discipline” has been tossed around loosely, 
so perhaps we had better try to define it. 

Although a set of agreed-to criteria do not exist by 
which one can determine whether or not a field rates as 
a discipline, there are several characteristics of accepted 
established “disciplines” that may be referred to when 
one has the problem, as we do, of deciding whether or 
not a given field is potentially a discipline. 

Established disciplines, like mathematics, have the 
following characteristics: 

1) The terminology has been established, a glossary 
of terms exists. 

2) Workers in the field do nonroutine intellectual 
work. 

3) The field has sometimes been axiomatized. 

4) The field is open, i.e., problems are self-regenerat- 
ing. 

5) There is an established body of literature, text- 
books, sometimes treatises — even handbooks — 
and professional journals. 

6) University courses, sometimes departments, and 
indeed schools are devoted to the field. 
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Most aspects of computers, data processing, and the 
related fields discussed in this study now meet these 
specifications or may be meeting them in the next ten 
years. It is clearly the job of university people to help 
create the axiomatization, theory, terminology, curricu- 
lum, etc. Computer science is not an isolated field. It is 
interdisciplinary. It is analogous to a library, or mathe- 
matics, where library science or mathematics are dis- 
ciplines in themselves as well as providing service tools 
to other disciplines. Thus, future university courses of 
instruction and research should be provided in the unit 
devoted to the disciplines of interest and in units using 
these disciplines as tools. 

The Computer 

Too much emphasis has been placed on the computer 
equipment in university programs that include fields 
in the computer sciences. Curriculum and research pro- 
grams have been designed as supplements to the com- 
puting equipment. The reverse should be the case. 
Computers should be supplements to a well-organized 
and integrated university program in the computer 
sciences! An important supplement, to be sure, but a 
supplement. Indeed, an excellent integrated program 
in some selected fields of the computer sciences should 
be possible without any computing equipment at all! 

Universities and Their Competition 

Universities must recognize and design around the 
fact that both industry and government have already 
undertaken and will continue to participate in the uni- 
versity’s traditional role of instruction and research in 
the computer sciences. This implies competition with 
government and industrytfor personnel, equipment, and 
financing. The university structures and policies built 
over the years to cope with traditional situations may 
not be adequate to cope with the present situation. 

A Recommended University Program 

The following is a detailed integrated program de- 
signed around the observations made in the previous 
section. 

Organization 

A graduate school, called the Graduate School of 
Computer Sciences, should be formally created. A policy 
committee, including a dean and his academic and 
service departments heads, should be appointed. Its 
primary function will be to set up and implement policy 
and budget on the function of the school, the curriculum 
for an integrated program of instruction and research, 
its relation to other schools and departments of the uni- 
versity, and other pertinent points, including financing. 
A “five-year plan” should be formulated. 

The dean of the school should report to the president. 
His responsible administrators will be scholars first and 
foremost. Their professional fields of interest will be in 
the fields of interest of the school, although they will 


have an appreciation and respect for the problems, 
philosophy, and ideals of scholars in fields other than 
those of their own interest. They will be enthusiastic, 
competent, and experienced. 

The school administrators will not be promoters. The 
problems of budget, financing, equipment support, and 
public relations — all of which normally require the 
talents and time of a promoter type — will, however, 
be assigned to professionals in these fields. They will 
report to the administration and will be observers in 
policy meetings. Theirs will be a service activity and not 
an academic one. 

None of the academic administrators will be required 
to show a profit on his activity. All activities with an 
objective of making money will be organized under the 
service activities associated with, but not an essential 
part of, the academic unit. 

The declared policy of the school concerning its fields 
of interest should be that it is interested in instruction 
and extension of knowledge in all fields that provide 
either quantitative models, or techniques for solutions 
of problems in quantitative models for phenomena of 
interest to the scholar. The theoretical and experimental 
fields enumerated earlier in this report are examples of 
these fields. 

Since it is recognized that some of these fields are dis- 
ciplines in themselves as well as tools for use by other 
fields, the Schools of Business, Industrial Engineering, 
Electrical Engineering, Mathematics, Physics, Psy- 
chology, etc., should be encouraged, when they find it 
desirable, to give courses and even to do research in 
fields normally covered by the new graduate school. 
But it is the responsibility of the new school to program 
an integrated activity for the whole university, 

The declared and announced policy of the school 
should be that its function is fourfold: 1) To train pro- 
fessional scientists; 2) to train scholars; 3) to do explora- 
tory research; and 4) to develop the new disciplines. 

These are only a few policies recommended for adop- 
tion, It cannot be emphasized too strongly, however, 
that the significant recommendations here are not so 
much in the detailed recommendations enumerated 
above, but in the recommendations for deliberate policy 
making and policy review of these matters. 

Departmental Structure 

The departmental structure of a new school must of 
necessity be arbitrary. One structure for consideration 
would consist of four departments as follows: 

Department 1 — Computer Department: This depart- 
ment would include groups concerned with computer 
equipment and service, faculty instruction in program- 
ming and computational models, model making, auto- 
matic programming, logic, computer organization, com- 
puter mathematics, computational models, computer 
theory, component and circuit research (hardware) if 
not covered in other departments, systems research 
(hardware) if not covered in other departments, etc. 
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Instruction and research in these fields will fall in 
this department. The computer equipment used in a 
service bureau or for instruction or research will be 
under the cognizance of the Computer Department 
head. 

Computation center: The purpose of the “computation 
and data-processing center” is to provide the tool for 
solutions of problems that have been cast into com- 
putational models by members of the university com- 
munity. 

The head of this center, who should be a scholar 
primarily, will be responsible for organizing the center 
operation for maximum efficiency. This will involve 
educating the general faculty, providing programming 
and operating assistance, promoting research and de- 
velopment of automatic programming and handling 
techniques, as well as assisting in the creation of com- 
putational models. By whatever means the equipment, 
facilities, and personnel are financed, the head of the 
computing center should not be obligated to show an 
operating profit. If profit is a prerequisite, then he 
should by all means go into business for himself. Func- 
tionally, the computer center should be organized into 
groups individually responsible for: 1) faculty educa- 
tion; 2) programming, coding, and operation assistance; 
3) model construction assistance; 4) research in auto- 
matic coding, handling, etc.; and 5) a planning, schedul- 
ing, and monitoring activity. 

The future demand for the products and services 
made available by a computing center depends, of 
course, upon the willingness and ability of the research 
faculty at the university to use them properly, as well as 
upon the utility and applicability of computational 
models in their work. It is true now that there is hardly 
a field in the physical, mathematical, biological, or social 
sciences that has not used computational models for re- 
search purposes. Future researchers in these fields will 
be using computational models more and more. A com- 
puting center will fill an important need on the univer- 
sity campus. It may be said, “To the extent that a re- 
searcher does not use such facilities when they are avail- 
able, applicable, and potentially useful, to this extent 
a researcher is not doing a competent job.” 

Department 2— Operations Research: This department 
will cover instructional and research activities in opera- 
tions research, linear programming, dynamic program- 
ming, game theory, queueing theory, and decision 
theory. 

Department 3 — Information and Communication : This 
department will cover instruction and research activities 
in information theory, switching theory, coding theory, 
automata theory, artificial intelligence, learning, lan- 
guage translation, and theory of simulation. 

Department 4 — Systems: This department will cover 
instruction and research activities in management 
science, econometrics, systems theory, information clas- 
sification, indexing and retrieval, model theory, self- 
organizing systems, and adaptive mechanisms. 


Curriculum 

The curriculum will contain courses, seminars, lec- 
tures, etc., that constitute an integrated program for 
students pursuing an advanced degree. What consti- 
tutes an integrated program will be determined as part 
of school policy. Special courses, seminars, or symposia 
may be given as the need arises. The curriculum will 
be expanded or modified in accordance with periodic 
scheduled evaluations by curriculum personnel. Ex- 
tensive special studies of what other universities and 
industry and government are doing will make appropri- 
ate information available to the curriculum evaluators. 

The curriculum will also reflect the needs of other 
departments and schools within the university. 

Faculty 

One of the reasons for recommending an independent 
entity such as a graduate school — rather than a depart- 
ment in an already existing school or interdepartmental 
committee — is to allow a freedom of choice in policy 
that can respond to the practicalities of the situations 
faced by the university today, without the constraints 
imposed by an existing structure designed to cope with 
situations that no longer exist. Thus, the department 
heads, if not others, may command perhaps $20,000 a 
year for their services in today’s competitive market. If 
this is indeed so, then the school should plan to put it- 
self in a position to obtain these people and to pay such 
prices. 

Each course, lecture, and symposium will be con- 
ducted by faculty members who are competent to teach 
the course, give the lecture, or conduct the symposium. 
The faculty will be interested, enthusiastic, and com- 
petent, and preferably experienced. They will be paid a 
salary commensurate with their experience and ability 
on a scale more in keeping with present industrial scales, 
not present university scales. 

No faculty member will be assigned to instruct be- 
cause his teaching schedule isn’t full, or because he is 
a competent researcher who “ought” to teach, or be- 
cause he is enthusiastic but inexperienced in the field. 

Research and Related Activities 

Research by graduate students or the faculty cannot 
be a planned affair, of course. One would expect to see 
research activity on selected topics of interest and 
within the capabilities of the school. 

However, there are related activities that may be 
arbitrarily categorized as research having to do with the 
development of new disciplines and the incorporation 
of information developed by research into the curricu- 
lum. A well-established discipline usually has a well- 
established terminology and a glossary, axiomatization, 
and text books, treatises, handbooks, journals, and 
organizations. Thus one should expect to see the re- 
search and instruction faculty engaged in establishing 
terminology, axiomatizing a field, writing or editing 
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books, journals, and other such material, and even 
helping to organize professional societies. 

Manual Aids 

Manual aids, such as desk computers, computresses, 
bookkeeping machines, punched card machines, and the 
like, have been extremely valuable to university scholars 
and students. The ability of modern high-speed equip- 
ment means that the quality and quantity of university 
output should be improved to the extent that this kind 
of manual aid is made available and used efficiently. 

It may be here noted that the quantity of the re- 
search output should increase not only because of the 
high speed and versatility of the electronic aid, but also 
because more of the problems, tasks, and the models 
created in the various disciplines become tractable with 
the use of the new aid. The quality should improve for a 
similar reason. There exist models of various phenomena 
in almost all disciplines whose problems have not been 
soluble with present techniques. 

In many cases simplifying assumptions are made and 
the model is revamped in order to make the problems 
soluble. In these cases, the wrong problems have been 
solved quite accurately. Present high-speed equipment 
makes possible the accurate solution of more of the 
right problems. 

The availability of high-speed computing systems 
and efficient operating crews has a much greater impact 
on the university output than, for instance, the availa- 
bility of batteries of desk calculators and operators. The 
curriculum, research program, and even faculty educa- 
tion are affected. Scholars and students will have to 
learn enough about the available devices to use them 
well with their present models. This knowledge should 
serve to motivate them to the creation of new machine- 
soluble models. In fact, a general theory of the construc- 
tion of techniques of machine-soluble model construc- 
tion for any discipline seems clearly to be a job for the 
university scholar. 

This is an unexpected but inescapable development 
growing out of the computer as a manual tool ! 

In addition, those techniques peculiar to the design 
and programming of high-speed equipment should be 
developed, if they can serve to make it a more efficient 
aid for manual routine work and especially if it results 
in an aid that could help handle more models. As a 
basis for this endeavor, there will undoubtedly be re- 
quired at least a competent theory of computing machines, 
a competent theory of models, and a competent theory relat- 
ing the two. 


Mental Aids 

Equipment for aid in routine mental tasks has fewer 
precedents than equipment as an aid for routine manual 
tasks. Computers are now being used to check results of 
some mental efforts. For example, they are being used 
to check the logical design of other computers. The abil- 
ity to use equipment as an aid to do routine mental 
work, and thus make it literally an adjunct of one’s self 
in creating new disciplines or developing existing ones, 
may turn out to be a discipline in itself. To be sure, this 
is rather vague, but this in no way weakens the convic- 
tion that it is a legitimate interest of the university. The 
university scholar must also be interested in the peculiar 
programming and design techniques of an equipment 
that will make the equipment most useful as an aid to 
routine mental work. 

Financing and Student Body 

Specific recommendations for sources of funds are 
not made here. The strong conviction exists that the 
present growing demand for this kind of activity by in- 
dustry and government is so great that students and 
money should be knocking on the door where such a 
program is available. 

Students will come from government, industry, and 
undergraduate schools. They will be interested in later 
applying their new knowledge to industrial or business 
pursuits; others will wish to be educated to become re- 
searchers and scholars in selected fields. How many of 
these aspirants attend the university will be a measure 
of the appeal, if not the distinctiveness, achieved by the 
university in these fields. 

Appendix 


Summary of Commercially Available Computers 
Installed at Universities Used for Univer- 
sity Research and Instruction 



Number 

Installed 

Estimated Number to Be 
Installed in 1959 

IBM 650 

65 

No estimate 

709 

1 

No estimate 

704 

2 

No estimate 

701 

1 

No estimate 

Sperry Rand 1101 

1 

5 large-scale machines 

1103 

2 

(models undetermined) 

1105 

2 


Univac 1 

4 


Burroughs 220 

1 

2 

205 

7 

6 

Bendix G-15 

12 

12-24 

Royal McBee LGP-30 

13 

20 

NCR 102D 

3 

— 
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The RCA 501 Assembly System 

H. BROMBERGf, T. M. HUREWITZf, and K. KOZARSKYf 


C URRENT techniques in automatic coding at- 
tempt to shift the user’s tasks away from the 
computer and closer to his application. Sacrific- 
ing coding details in this way, it is believed that monu- 
mental savings will result both in computer accepta- 
bility and utilization since everyone is now able to de- 
scribe his own programs. Universal acceptance of prob- 
lem oriented languages has, for several reasons, not yet 
followed. One influence is that the generated object 
programs reflect the adroitness of the executive routine 
and the remoteness of the input language. 

In a recent count, there appears to be well over one 
hundred automatic coding systems produced for twenty 
or more different computers. This reflects the recogni- 
tion of the disparity that exists between the methods of 
problem preparation and actual problem solution. 

By most methods of classification, these hundred-odd 
automatic codes range more or less continuously be- 
tween extremes. They vary considerably in complexity, 
extent of problem area of useful application, and in 
range of intended user. 

One categorization used to differentiate these auto- 
matic codes is by the sophistication of the input lan- 
guage — particularly whether this language is “problem 
oriented” or “machine oriented.” However, it must be 
admitted that if a ranking of these automatic codes is 
made according to efficiency of the object program, the 
list would tend to be in nearly inverse order to that ob- 
tained by ordering on the level of the input language. 
It should also be noted that evaluations of the academic 
aspects of these automatic codes are often greatly at 
variance with the judgments of the occasionally unfor- 
tunate users of these routines. 

This is not to say that object program efficiency is 
the only value criterion of an automatic system. Fre- 
quently for short programs or where the capacity of the 
data-processing equipment greatly exceeds the required 
performance, it is almost irrelevant. But, in instances 
where object program efficiency is significant, alterna- 
tive coding procedures are desirable. 

It is conceded that the Problem Oriented Language 
deservedly has greater prestige than the Machine Ori- 
ented Language and greater theoretical interest (at least 
from a philosophic or linguistic point of view). Never- 
theless, the current mechanization of these languages 
and the distribution of computer expenses dictate de- 
mands for both types. It is recognized that direct, facile 
communication between the layman and his computer 
as well as the advantage of interhuman communication 
of the problem definition are obtainable from a Problem 

f RCA, Camden, N. J. 


Oriented Language. However, there are also needs for 
programs handling tasks near the limits of the equip- 
ments’ capabilities as well as for infrequently changing, 
very highly repetitive, data-processing routines. 

One of the often expressed goals in automatic coding 
is the development of complete problem oriented lan- 
guages entirely independent of any computer. To pro- 
duce any “most efficient” coding in this circumstance 
means that, among other things, psychological infer- 
ences as to the intentions of the writer are to be made 
by the automatic code. Furthermore, the apparent 
trend in machine design toward many simultaneous 
asynchronous operations, multiprogramming and the 
like, increase the problems associated with producing 
efficient machine programs from a problem oriented lan- 
guage. It is hardly unreasonable for a user of the new 
potentially powerful systems to request a coding scheme 
capable of using these complex, expensive features. 

It appears likely then, in the near future at least, 
that some problem oriented languages will be aug- 
mented by some prosaic statements, directly or indirect- 
ly computer-related, which will permit attainment of a 
more “most efficient” machine code. Similarly, machine 
oriented languages may also yield to this trend and in- 
corporate some features, within their inherent limita- 
tions, which tend to be associated with problem lan- 
guage codes. 

All this may justifiably be construed as motivation for 
the automatic routines offered with the RCA 501. The 
first of these, a machine oriented automatic code, the 
RCA 501 Automatic Assembly System, is described in 
this paper. 

The Assembly System provides for: relative address- 
ing of instructions and data; symbolic references for 
constants and data; macro-instructions and subrou- 
tines; variable addresses; and descriptor verbs. 

Some Machine Characteristics 

It is appropriate, as background for what follows, to 
describe briefly some of those features of the 501 Com- 
puter (Fig. 1) which have influenced the design of the 
Automatic Assembly System. (Incidentally, this com- 
puter has been in operation in Camden since April, 
1958.) 

The 501 Computer has a magnetic core storage with 
a capacity of 16,000 characters, which is increasable in 
steps of the same to a maximum of 262,000 characters. 
Each character, consisting of six information bits and 
one parity bit, is addressable, although four characters 
are retrieved in a single memory access. Binary address- 
ing of the memory is provided, requiring 18 bits or three 
characters per address. 
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Fig. 1 — The 501 computer. 
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The instruction complement consists of 49 two-ad- 
dress instructions (see Fig. 2). Each instruction con- 
sists of : 

One character for the operation symbol 

Three characters for the A address 

One character for the selection of address modifiers 

Three characters for the B address, 

for a total of eight characters or sixteen octal digits. 

There are several control registers, each of which 
stores 3 characters, or one address (see Fig. 3). The 
A and B registers are used to store the A and B ad- 
dresses of operands during their execution. The P or 
program register stores the memory address of the next 
instruction in sequence. The T register stores a memory 
address which is made use of by certain instructions 
which require three addresses during their execution. All 
of these registers are addressable and therefore directly 
accessible to the program. 

Seven address modifiers are available. Four of these 
are standard memory locations and three are the A, T, 
and P registers. Use of the P register as an address 
modifier permits the writing of self-relative machine 
coding which may be operated, without modification, 
in any part of the memory. 

The RCA 501 exploits certain control symbols (Fig. 4) 
in the data. The Start and End Message control sym- 
bols define a message on tape, and in the memory also 
act as control symbols for certain instructions. The Item 
Separator control symbol is not used as such on tape, 
but is used in the memory to control certain operations. 
These control symbols permit variable item lengths 
and variable message lengths both on tape and in the 
memory. The entire message may be variable, depend- 
ent upon the number and size of the individual items. 
The instruction complement includes both symbol con- 
trolled and address controlled operations. 

The 501 includes provision for simultaneous read- 
write, read-compute and write-compute. This is ac- 
complished by designating magnetic tape instructions 
as “potentially simultaneous” and establishing a pro- 
gram controlled gate between the normal and simul- 
taneous modes of operation. Thus, the programmer can 
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Fig. 4 — Control symbols. 


permit completely automatic switching of tape instruc- 
tions to the simultaneous mode or he may optionally 
bracket off portions of the program where such switch- 
ing is inconvenient. 

As for the 501 Assembly System, there are two pro- 
grammer-prepared inputs. To guide the Assembly Sys- 
tem in generating a running program from the pseudo- 
code, the user provides the system with a description of 
the data files which the program is designed to process. 
A portion of a data sheet is illustrated in Fig. 5. 

Certain auxiliary computations are performed on 
these data sheets and the results printed out for the 
programmer’s information — such as average message 
lengths, approximate tape passage time, and weighted 
average. 

Data Addressing 

Completely variable length data, on the one hand, 
yields economies in tape storage aiid effective file pas- 
sage time; on the other hand, it presents certain prob- 
lems with respect to symbolically addressing data items 
in the memory. These problems are handled in several 
different ways: 

1) Those items whose lengths are fixed relative to the 
beginning of the message may, of course, be direct- 
ly addressed by the data name designated in the 
data sheets. 

2) The variable length items of a message may be 
transferred to a working storage area of memory 
where space is allocated for the maximum possible 
size of each variable length item. A single pseudo- 
instruction performs this function. From this 
point on, the variable length item may be directly 
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Item No. 

Sub Item 

Abbreviation 

Description 



D 

No. Char. 

% Use 

Wtd. Avg. 


n 

1 


Date 

File Label 

X 

R 

1 

6 

6 




A 

Month 





2 

2 




B 

Day 





2 

2 




C 

Year 





2 

2 




Fig. 5 — The automatic code data sheet. 


Instr. No. 

Comments 

OP 

A 

Address 

B 

Address 

T 

Address 

C 

s 

G 

IF 

GO TO 

IF 

GO TO 

IF 

GO TO 















Fig. 6— The automatic code program sheet. 


Instr. No. 

Comments 

OP 

A 

Adress 

B 

Address 

T 

Address 

C 

S 

G 

IF 

GO TO 

IF 

GO TO 

IF 

GO TO 

PRO 1 


LRF 

POLCY 











(Macro-Instruction) 


POLCY 




EF 

PRO40 + 1 

ED 


■ 




SC 

DATE 

DATE 

“122558” 


+ 

PR023+5 

- 

PHI 15 

0 

N 

PRO 20 

“Extra Beneficiary” 

DEFK 

KBEN 












DA 

RATE 


WRATE 










ADV 


+V4 










Fig. 7 — Assembly pseudocode entries. 


addressed by the data name preceded with a W 
(representing working storage). 

3) It is possible to locate the address of any item in a 
message by using an instruction which scans a 
message searching for and counting the control 
symbols defining items. This instruction leaves 
the address of the item in an address modifier. 

The second input, whose format is shown in Fig. 6, is 
the pseudocode written on the program sheets. ' 

This is seen to be an expansion of the machine code 
format which normally includes the operation field and 
the two addresses A and B. This is augmented on this 
sheet by the T or third address which for several ma- 
chine instructions requires presetting. Furthermore, 
there are provisions for 3 “IF-GO TO” statements pro- 
viding for conditional or unconditional transfers of con- 
trol. N 

The inclusion of these IF-GO TO statements as an 


optional part of every pseudoinstruction line has two 
primary motivations. First, it accommodates as a single 
pseudocode statement the function “Compare and 
Jump” which has a relatively high frequency in data- 
processing problems. Second, about f of the 501 instruc- 
tions automatically set a register to 1 of 3 states depend- 
ing on conditions encountered during the operation of 
the instructions. Branching instructions may then be 
used to select different paths depending on the setting 
of the 3 state register and are easily designated in the 
IF-GO TO columns. 

A single character entry in the CSG column generates 
an instruction to open or close the simultaneous gate, 
controlling the phasing of simultaneous tape operations. 

Variable Instruction Generation 

An interesting feature of the Assembler is the han- 
dling of the normal complement of 501 machine instruc- 
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tions. Each of these instructions, when used in the ex- 
panded pseudocode format, assumes the identity of a 
macro-instruction. Up to five, two address machine in- 
structions may be generated by employing a single 
machine operation code with appropriate entries along 
the pseudocode line. As an example, the 501 instruc- 
tion, Decimal Substract, may accomplish not only a 
three address subtraction, but also a simultaneous gate 
operation, and transfers of control dependent upon the 
sign of the difference. 

Symbolic and Relative Addressing 

Included in the automatic system are such features as 
mnemonic operation codes, symbolic and relative ad- 
dressing including the use of both alphanumeric and 
octal literals for constants, and acceptability of machine 
code should it be desired. Fig. 7 shows examples of pseu- 
docode entries. 

The Instruction number is composed of characters 
designating the page and line number of the instruc- 
tion. 

When addressing an instruction, reference is made to 
that instruction whose elements appear in the Opera- 
tion, A, and B fields of the program sheet. Since, how- 
ever, one single pseudoinstruction may account for as 
many as five machine instructions, it is desirable to 
address those. Therefore, a stipulated suffix to an in- 
struction number will allow a reference to any gener- 
ated instruction and to any field or desired character 
within one of these instructions. 

Relative addressing of these symbolics enables the 
programmer to refer to the Nth. pseudoinstruction fol- 
lowing or preceding a given symbolic. The program will 
be ordered by page number in alphabetic sequence and 
within pages by line number before any processing is 
undertaken. Accordingly, to accomplish an insertion, 
one need only assign appropriately sequential labels to 
the desired instructions and the program will place 
them in the proper positions. 

It is not necessary, however, to label every instruc- 
tion. Relative addressing allows reference to be made 
to unlabeled instructions in the program. One might 
usually expect to be labeled the first of a sequence of in- 
structions performing a logical function and those to 
which frequent reference is made by other instructions, 
but this is left solely to the discretion of the program- 
mer. 

Descriptor Verbs 

Descriptor verbs constitute an important part of the 
automatic code. These verbs contribute, in general, only 
to the description of the program and do not become a 
directly converted active part of the machine code. 

These special verbs perform a variety of functions 
such as the definition of program segments, overlaying 
memory regions, reserving areas of memory, extracting 
the machine address corresponding to any symbolic 
name, defining constants and variables and providing 
for insertions, deletions, and corrections in pseudocode. 


These verbs are executed during assembly and are 
deleted from the final program. 

Variable Addressing 

Another programming aid incorporated within the 
system is the variable address feature. A variable ad- 
dress allows the specification of addresses or constants 
to be symbolically named and to be defined later in the 
program. A variable may be substituted for any other 
machine or symbolic address in any instruction. This 
feature, for example, permits tagging, as a variable, the 
address of an instruction not yet written. It is only 
necessary then, at a subsequently convenient moment, 
to employ the Descriptor Verb, “Define V,” to supply 
the actual address of this variable for every place it was 
used. 

It is also possible to use variable addresses in addi- 
tion to any machine or symbolic address. This is ac- 
complished by placing the variable address in the same 
column as the one to which the variable is to be applied 
and in the directly succeeding line. A plus or minus pre- 
fix will then specify addition or subtraction of the vari- 
able address. A variable to be added or subtracted will 
not be applied until the variable is converted to an ac- 
tual machine address. The use of variable addresses, 
then, allows for symbolically designated modification of 
the program at the actual or machine code level. 

Literals 

Literals, or constants whose address and name are 
identical, are used in the assembler. Two types of liter- 
als are provided, alphanumeric literals for operations 
with data, and octal literals for operation with instruc- 
tions which, it has been noted, are binary coded. 

A literal is normally carried along with the segment in 
which it appears. However, a terminal character of the 
literal may be used to specify that the literal be stored in 
a common constant pool available to all segments of a 
program. A terminal character may also be used to 
designate and to differentiate among duplicated copies 
of the same literal in the program. Here, too, these 
duplicate literals may be associated with the segment 
or with the common pool of constants. 

Alternatively, of course, constants may be defined by 
a “Define Constant” Descriptor Verb and assigned an 
arbitrary symbolic address. Terminal characters on 
these constants perform the same functions with these 
constants as those just described. 

Macro-Instructions 

The macro-instructions included with the Assembler 
create 2 address symbolic coding which is spliced direct- 
ly into the main body of coding in place of the macro- 
instruction pseudocode call-line. A single macro-instruc- 
tion will generate all of the instructions required to per- 
form some task which would normally require the writ- 
ing of a sequence of machine instructions. 

Parameters, which the macro-instruction uses, are 
specified at the pseudocode call-line by the program- 
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mer. No restrictions exist as to number or size of these 
parameters. If a macro-instruction is to be generative, it 
contains one other part aside from the main body of 
stored coding. This part decides, from an interrogation 
of call-line parameters, which particular set of macro- 
instruction coding is to be included in the main routine. 

Subroutines 

The assembly system provides for an expandable li- 
brary of subroutines to be available to the programmer. 
These subroutines generate assembly language pseudo- 
code and as such may use all the assembly features such 
as macro-instructions, descriptor verbs, and so forth. 
Subroutines may be open or closed and generative or 
fixed. 

Parameters for subroutines are specified at the pseu- 
docode calling line. For open subroutines, parameters 
are incorporated during the operation of the assembler. 
These parameters may merely be substituted in the 
subroutine as in the case of a fixed routine or may be 
subject to considerable testing and manipulation as oc- 
curs with a generative subroutine. 

Closed subroutines may either incorporate param- 
eters during assembly or use parameters generated by 
the running machine program. In this case the parame- 
ters are located relative to the subroutine call-line. 

The design of the system is open to the extent that 
any useful number of macro-instructions and subrou- 
tines may be added. 

Provisions for Program Modification 

The Assembly System offers two main listings for 
program up-dating. First, listings are given of the object 
machine code and the Assembly language pseudocode. 
Second, is a list of all symbolic addresses and those in- 


structions referring to them. In addition, the Assembly 
System generates an information block preceding each 
object program. This block, which contains all program 
stops, breakpoint switches, and tape addresses is avail- 
able for input to a service routine which will modify any 
corresponding entries within the object program. 

There are two types of error indicators used by the 
Assembler. One causes the Assembly System to print 
the source of trouble and stop immediately. The other 
and major class consists of on-line printed statements in- 
dicating the type and location of errors. In this case the 
Assembly System continues its functions ignoring the 
“guilty” statements until all such indicators have been 
found. This permits the user to specify corrective meas- 
ures for all errors at one time. 

In summary then, the 501 Assembly System lies in 
an intermediate category. On the one hand, it is defi- 
nitely machine oriented, amplifying the 501 instruction 
complement and requiring a knowledge of the 501. 
However, it also provides for a flexibility of order state- 
ments, not confined to the 2 address machine order 
code. A variable number of machine instructions are 
generated dependent upon the number and types of 
entries made on each pseudocode line. Both macro- 
instructions and subroutines may be of the generative 
type and since the library is open-ended, may be aug- 
mented whenever necessary. 

In short, the RCA 501 Assembly System is a pro- 
grammer’s aide, enabling him to make maximum use 
of machine capabilities with a minimum of clerical effort. 
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A Program to Draw Multilevel Flow Charts 
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Introduction 

T HE preparation of a program for a digital com- 
puter is not complete when a list of instructions 
has been written. It still must be determined that 
the instructions do the required job, and if necessary the 
instructions must be changed until they do. Also a de- 
scription of the program should be written for others 
who may want to understand the program. A useful tool 
for the last purpose is a graphical outline of the pro- 
gram — a flow chart. 

f IBM Res. Center, Yorktown Heights, N. Y. 


Flow charts serve two important purposes: making a 
program clear to someone who wishes to know about it, 
and aiding the programmer himself to check that the 
program as written does the required job. A flow chart 
drawn by the programmer would serve for the first pur- 
pose, but drawing one is often a tedious job which may 
or may not be done well. For the second purpose, it is 
important to have the flow charts show accurately what 
the program does rather than what the programmer 
might expect it to do. Consequently, it was decided to 
write a program, the Flowcharter, for the IBM 704 
to produce flow charts automatically from a list of in- 
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structions. Another reason for the project was to get 
further insight into the characteristics of computer pro- 
grams. 

Since programs in many programming languages and 
even for different machines differ mainly in superficial 
aspects such as names and numbers for various opera- 
tions, names and types of registers, it was decided to 
have the Flowcharter do the main part of its work on a 
common, machine-independent language and to have a 
set of preprocessors, each of which would translate one 
programming language into this common internal 
language. 

We also felt it was desirable not to attempt to show 
the whole program as one chart, which for a moderate 
size program would either present a confusion of detail 
or be too general to serve the purpose. In order to pro- 
vide both a good general picture of the program or of 
any part of it, and a more detailed description of a 
smaller piece of program, the Flowcharter produces a 
series of flow charts on a number of levels of detail ; each 
part of a chart is shown in more detail on a succeeding 
chart. How to determine the makeup of the charts was 
one of the most difficult problems encountered in plan- 
ning the Flowcharter. 

Another feature of a flow chart is a description of 
the procedure represented by each box. The Flow- 
charter provides a summary of the machine input-out- 
put done in the box and a summary of the computation 
done in the box, listing the quantities computed and 
those used in the computation of each of them. 

Description of the Flowcharter 

The Flowcharter is composed of four main parts: the 
preprocessors, the flow analysis, the computation sum- 
mary, and the output program. 

The preprocessors each do a simple translation from 
the external instructions to the internal language. For 
most machines, an instruction may represent several dif- 
ferent processes done in the machine, such as fetching 
from memory, storing the memory, and instruction se- 
quencing. These operations are each described sepa- 
rately in the internal language. One external instruction 
is translated by the preprocessor into a suitable list of 
these operations. 

Many of the problems which arose in designing a 
Flowcharter were in the section which determines what 
is to be shown on each chart. It is very easy for the pro- 
grammer to mark off his program into logical parts, but 
to determine these from the program itself is quite dif- 
ficult in most programming languages. We have worked 
out a set of techniques which we feel will do quite well 
for most programs and will be acceptable in other cases. 
We have also provided facilities for the programmer to 
specify how he would like the breakdown done on vari- 
ous levels if he does not like the choices made by the 
Flowcharter. The techniques used depend mainly on 
analysis of the flow properties of the program but provi- 
sion is also made in the Flowcharter for using the data 
to help in the analysis. The Flowcharter is written in 


such a way that various techniques and combinations of 
techniques can be tested to see what results they give. 

This flow analysis is done by iteratively forming re- 
gions from groups of subregions. The smallest sub- 
regions are individual instructions. In general, each 
region will be represented by one flow chart and each 
box drawn on the chart will represent a subregion of 
that region. However, when it is reasonable, two or 
more regions, each consisting of only two or three sub- 
regions, will be shown on one flow chart. This is done 
to keep the output moderately compact. Also, those 
regions which are formed directly from instructions are 
not shown as flow charts but are given as a list of the 
instructions in the region, with a reference to the page 
on which this region is shown in context. (The Appendix 
shows an example of this.) 

The techniques used for region formation are of two 
kinds, combination and division. A combination tech- 
nique is one which starts with individual instructions 
and, by repeated applications, combines them into 
larger and larger regions. A division technique is one 
which starts with the whole program and divides it into 
smaller parts. Each of these parts is in turn divided 
until each part consists of not more than six or seven of 
the regions formed by the techniques of the first type. 
Each technique is represented by a subroutine. 

Each combination subroutine searches for a particular 
configuration of flow in the program. Three such sub- 
routines are: STRING, DIAMND, and TEST, which 
look for “strings,” “diamonds,” and “test sets.” 

A “string” (see Fig. 1) is an ordered set of regions 
satisfying the condition that every region, except the 
first, has an entry only from the preceding region and 
each, except the last, has an exit only to the next one. 

A “diamond” (see Fig. 2) is a set of regions containing 
a first region F, a last region L, and some intermediate 
blocks. Each intermediate block must not have any 
predecessor other than F nor any successor other than 
L. All successors of F and predecessors of L must be in 
the “diamond.” 

A “test set” is a set of regions which together make up 
a compound test. A set of regions forms a “test set” 
if each region ends with a test of the same special 
register. Also, every region except the first may have 
only one predecessor which must also be in the set. 
Finally, only the special register tested may be changed 
by the instructions in any of the regions except, pos- 
sibly, the first one. For example, consider the 704 SAP 
instructions : 


CLA 

ALPHA 

TZE 

ISZERO 

SUB 

ONE 

TZE 

ISONE 

SUB 

ONE 

TZE 

ISTWO 

SUB 

ONE 

TZE 

ISTHRE 
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(a) (b) (c) 

Fig. 1 — In each case the dotted lines enclose a “string.” (Circles 
represent regions formed earlier and solid lines represent paths 
of flow in the direction of the arrow.) 



Fig. 2 — In each case the dotted lines enclose a “diamond.” (Circles 
represent regions formed earlier and solid lines represent paths 
of flow in the direction of the arrow.) 


The pair CLA, TZE, and each pair SUB, TZE make 
up a region found by STRING; then these four regions 
"will be combined by TEST. 

It should be pointed out that the first two configura- 
tions, “strings” and “diamonds,” are sufficient to de- 
scribe most programs. Iterative loops do not have to be 
taken care of separately; when the program within the 
loop is combined into a region, the return path of the 
loop is also included in the region. For example, in Fig. 
1(a) and Fig. 2(d), the return path, P, although not a 
part of the string or diamond, is a link between the sub- 
regions forming the region and is therefore included in 
the flow chart of that region. The example used to show 
the output of the Flowcharter is a program for which 
only STRING and DIAMND are needed. 

The division subroutines attempt to discover par- 
ticular configurations “in the large.” Two such sub- 
routines are UNWRAP and SPLIT which look for loops 
and easily separable parts of the program. The division 
subroutines are not allowed to separate the regions al- 
ready built up by the combination routines. 

UNWRAP determines if the program is essentially 
one large loop; that is, it has an entry block E\, which 
has only one successor S, an exit block E 2 , which has 
only one predecessor P, and there is a path from P to S. 
In this case, the region representing the program is 
made up of three subregions: E\, E%, and the subregion 
including everything else. The last now becomes the 
“program” to be divided further. 


SPLIT looks for the situation where the program is 
composed of several essentially distinct parts, each of 
which has only one entry point and one exit point for 
paths to or from other parts. Each such part is one 
subregion and is divided further if necessary. 

At present, STRING, DIAMND, and TEST are 
used repeatedly until none of them can do any further 
combining. If there are no more than six regions left, 
these are combined to make the region representing the 
entire program. If there are more than six left, UN- 
WRAP and SPLIT are used repeatedly until they have 
either divided the entire program into the regions left 
by the combination routines or can not divide it any 
further. In the latter case, at present, arbitrary divisions 
are made until the program is so divided. 

This method should be adequate for most programs; 
however, the Flowcharter is written in such a way that 
routines can be added and other methods tried easily. 

In planning the computation analysis, the major 
problem encountered was that of determining when cells 
or registers were used only as temporary or erasable 
storage. In order to keep the amount of information 
down to a readable size, we wanted to list only the cells 
actively used in the region. We started with the idea of 
labeling a quantity computed but not used as “output,” 
and those computed and then used, “tentative out- 
puts” to indicate that they might be erasable cells. A 
“tentative output” was carried forward until an exit 
from the program or a use of the same' quantity was en- 
countered. If there was such a use, the “tentative out- 
put” became a real output — if not, it was considered 
erasable and would not appear further on the flow 
charts for that part of the program. Since a “tentative 
output” had to be carried forward on all possible paths 
but changed to a real output only on those paths on 
which a use was encountered, the bookkeeping necessary 
became unmanageable when the flow of the program 
was complicated. 

If the computation is traced backward rather than 
forward, the procedure becomes much simpler. If a 
quantity is needed at one point of a program, it must 
be available along every possible path backwards from 
that point until some point is encountered where the 
quantity is computed or until an entrance to the pro- 
gram is encountered. In the latter case, this quantity 
must be available at that entrance to the program. 
With each region shown on a flow chart, all the quanti- 
ties computed in that region are listed except those 
erasable cells which are used only within the region. 
For each quantity computed, there is given a list of 
quantities which are required at the entrances to the 
region and which enter into the computation of this 
item whether directly or indirectly. 

The last part of the Flowcharter arranges and prints 
the results of the other sections. The appearance of the 
final flow charts will be one of the most important fea- 
tures to anyone using the Flowcharter and will be as 
much like hand-drawn flow charts as possible. Each page 
will show one region composed of as many as six or seven 



134 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


smaller regions. Each of the boxes will indicate the 
entry and exit locations of the subregion it represents, 
the number of the page that has a detailed flow chart 
•of the subregion, the location of any exits and entrances 
in the middle of the region and the exit conditions for 
any exit. Each box will be outlined by asterisks and all 
transfer paths will be represented by lines to the appro- 
priate points. If the lines go outside the region, the name 
and page number of the instruction at the other end will 
be given at the top or bottom of the page for entrances 
or exits, respectively. The boxes themselves will be ar- 
ranged in two dimensions to show the flow in the region 
as clearly as possible, using horizontal as well as vertical 
displacements of the boxes. On the right-hand side of the 
page will be listed further information in three columns: 
the names and numbers of any input-output units used, 
and which memory cells were involved in each case; a 
list of those quantities for which values must be avail- 
able at the entrance to the region; and the computation 
summaries mentioned above. Provision is also made for 
reproducing comments, box titles, page titles, and other 
comments given by the programmer. (See the Appendix 
for an example of the output.) 


Status of the Program 

On February first, the program described here was 
nearly complete and checked out with a preprocessor 
for 704 SAP language. The parts not yet finished were 
UNWRAP, SPLIT, the drawing of the boxes and lines 
in the output program, and provision for some of the 
specifications by the programmer. In each case, much 
or all of the planning has been done. 

As soon as these are complete, it is planned to write 
705 and FORTRAN preprocessors and at least one re- 
gion forming subroutine which considers mainly the 
data used by the program, and has much less emphasis 
on the flow properties than the routines described here. 
Also planned is some experimentation with various 
methods of region formation. 
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Appendix 


The source program flow charted here is: 

C A PROGRAM TO MULTIPLY TWO MATRICES AND SUB STITUTE PLUS ZERO FOR 

C EACH ZERO ELEMENT, PLUS ONE FOR EACH POSITIVE ELEMENT, AND MINUS 
C ONE FOR EACH NEGATIVE ELEMENT. 


10 

READ 200 ((M(I, J), 1= 

= 1, 3), J = l, 4), ((N(J, K), J=l, 4), K = l, 5) 

20 

DO 1401=1, 3 


30 

DO 130 K = l, 5 


40 

L(I, K) = 0 


50 

DO 60 J=l, 4 


60 

L(I, K) =L(I, K)+M(I, 

J) * N(J, K) 

70 

IF (L(I, K) 120, 100, 80 


80 

L(I, K) = +l 


90 

GO TO 130 


100 

L(I, K) = 0 ' 


110 

GO TO 130 


120 

L(I, K) = — 1 


130 

CONTINUE 


140 

CONTINUE 


150 

PRINT 200 ((L(I, K), I 

= 1,3),K=1,5) 

160 

STOP 


200 

FORMAT (1514) 



EXPLANATION OF CONTENTS OF BOXES IN THE FLOW CHARTS 
PATHS INTO THIS SUBREGION 


V 


V 


V 




* FIRST LAST 

* LOCATION LOCATION 

* 

* 

* 

* 

* 

* 

* 


WHERE TO FIND MORE 
DETAILS ABOUT THIS 
SUBREGION 


EXIT CONDITIONS 


READING 

WRITING 


VALUES COMPUTATION 

REQUIRED DONE 


V V V 

PATHS OUT OF THIS SUBREGION 
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PAGE 1 

A PROGRAM TO MULTIPLY TWO MATRICES AND SUBSTITUTE PLUS ZERO FOR 
EACH ZERO ELEMENT, PLUS ONE FOR EACH POSITIVE ELEMENT, AND MINUS 
ONE FOR EACH NEGATIVE ELEMENT. 


ENTRANCE TO 



READING 

VALUES 

COMPUTATION 

PROGRAM 



WRITING 

REQUIRED 

DONE 

V 






*********** 

* * * 





* 10 

20 * 


READ CARDS • • • 


M(I, J) • • • CARDS 

* P.3 

* 


M(I, J) 


N(J, K) • • • CARDS 

* 

* 


N(J, K) 


I • • • +1 

* UNCOND 

* 





*********** 

* * * 





V V 






**:|c*:|i:|c**:|«:i::|s 

* * * 





* 30 

30 * 




K • • • +1 

* P.3 

* 





* 

* 





* UNCOND 

* 





*********** 

* * * 





V V 






*********** 

* * * 





* 40 

130 * 



I 

K • • • K 

* P.2 

* 



K 

+ 1 

* 

* 



M(I, J) 

L(I, K) • • • +0 

* K IS TO 5 

* 



N(J, K) 

-1 

* GREATER LESS 

= * 




+ 1 

*********** 

* * * 





V 







* * * 





* 140 

140 * 



I 

I ... I 

* P.3 

* 




+ 1 

* 

* 





* I IS TO 3 

* 






* GREATER LESS, = * 


V 

* 150 160 * PRINT • • • L(I, K) 

* P.3 * L(I, K) 

* * 

* STOP * 


V 

EXIT FROM 
PROGRAM 



> * 
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30 READING VALUES COMPUTATION 

P.l WRITING REQUIRED DONE 


* 

* 

* 

* 

* 

* 


* 

* 

* 

* 

* 

* 

* 


* 

* 

* 

* 

* 

* 

* 




* 80 90 * 

* P.3 * 

* UNCOND * 


************** 


* 

* 

* 

* 

* 

* 


* 

* 

* 

* 

* 

* 

* 


V V 

40 50 * 

P.3 * 

UNCOND * 


V V 

60 60 * 
P.3 * 

$]C 

J IS TO 4 * 


GREATER LESS, = * 


V 

70 70 * 

P.3 * 

$ 

L(I, K) IS TO 0 * 

GREATER = LESS * 


V 

100 110 * 
P.3 * 

UNCOND * 


V 


* 120 120 * 

* P.3 * 

* * 

* UNCOND * 




V V V 

130 130 * 

P.3 * 

* 

K IS TO 5 * 

GREATER LESS, = * 


I L(I, K) • • • +0 

K J---+1 


I 

J 

K 

M(I, J) 
N(J, K) 
L(I, K) 


L(I, K) 


J 

+1 


L(I, K) 
M(I, J) 
N(J, K) 


I 

K 

L(I, K) 


I L(I, K) • • • +1 

K 


I L(I, K) • • • +0 

K 


I L(I, K) 1 

K 


K K • • • K 

+1 


V 

140 

P.l 
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INSTRUCTIONS 

C A PROGRAM TO MULTIPLY TWO MATRICES AND SUBSTITUTE PLUS ZERO FOR 
C EACH ZERO ELEMENT, PLUS ONE FOR EACH POSITIVE ELEMENT, AND MINUS 
C ONE FOR EACH NEGATIVE ELEMENT. 

10 READ 200 ((M(I, J), 1=1, 3), J=l, 4), ((N(J, K), J=l, 4), K = l, 5) 

20 DO 140 1=1, 3 


FOR CONTEXT 
SEE PAGE 


1 


30 

DO 130 K= 1,5 

1 

40 

50 

L(I, K) = 0 

DO 60 J=l, 4 

2 

60 

L(I, K) = L(I, K) M(I, J) * N(J, K) 

(END OF DO AT 50) 

2 

70 

IF (L(I, K) 12, 10, 80 

2 

80 

90 

L(I, K) = +l 

GO TO 130 

2 

100 

110 

L(I, K) = 0 

GO TO 130 

2 

120 

L(I, K) = — 1 

2 

130 

CONTINUE 

(END OF DO AT 30) 

2 

140 

CONTINUE 

(END OF DO AT 20) 

1 


150 PRINT 200 (L(I, K), 1= 1, 3), K = 1, 5) 1 

160 STOP 


A Compiler Capable of Learning 

RICHARD F. ARNOLD f 


Introduction 

W E WOULD like to consider a new approach to 
the general problem of programming computers. 
To date, the methods of handling programming 
problems can be roughly classified into two families, 
each of which have certain characteristic advantages 
and disadvantages which seem to complement those of 
the other. 

The first group, developed from the subroutine philos- 
ophy, includes all interpretive schemes, as for example 
the "Bell Labs Interpretive System” for the IBM 650. 
The advantages of interpretive routines are that they 
are very versatile in the languages they can interpret 
and are comparatively easy to write. It is a fairly simple 
matter to write an interpretive routine to simulate an- 
other computer and thus achieve program compatibil- 
ity between different machines. The crippling drawback 
is the excessive time needed to execute routines inter- 

f Michigan State University, East Lansing, Mich. This research 
was supported in part by a grant from the Natl. Sci. Found, and 
taken from a thesis written under the direction of G. P. Weeg. 


pretively. Higher order interpretive schemes increase 
executions time exponentially. 

The second group consists of compilers and assembly 
programs. They are characterized by the fact that, un- 
like interpretive routines, they produce object programs 
which may be executed in reasonable amounts of time. 
Compilers, however, are difficult to write. "Fortran,” 
for example, took twenty-five man years to write. A 
second difficulty of compilers such as "Fortran,” is that 
although they are becoming more and more versatile, 
they still fail to express certain types of operations, and 
it has become necessary to make it possible to adapt the 
compiler so that the "Fortran” language may be tem- 
porarily left and programming done in a language closer 
to the initial machine language. Of course, this is a 
desirable feature for a compiler to have, but it does not 
solve the initial problem for which it was created, 
namely, to avoid machine languages completely. A fur- 
ther disadvantage is that as a compiler system becomes 
adapted for use on more than one computer, many of the 
"coding tricks” will have to be avoided. This may be de- 
sirable from the point of view of the compiler writer, but 
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optimum programs will not be written. This will be par- 
ticularly true as newer generations of computers are built 
which may have less similarity to present ones than we 
expect. 

Also, as new computers come into existence, it is 
likely that it will become increasingly difficult for even 
the trained programmer to use the machines efficiently. 
Only time can tell whether or not compilers and their 
improved offspring are the answer for the future. 

However, we would like to offer a different approach 
to the problem which, although it will probably not 
prove to be a better solution for the present, may indeed 
offer a much better line of attack for the future. Let us 
review first what is desired. We would like a scheme 
that would accept a program, either in the pseudo code 
of a compiler or in a different machine language, and 
would produce an object program for a given machine. 
We would like this program to take full advantage of 
all the special features of the machine and to be coded 
in a fashion such that running time and program length 
are at a minimum. We would also like to avoid too much 
work in adapting this scheme to a new machine. Since 
we may not know ourselves just how best to program a 
given machine, we would like it if the scheme itself could 
find the optimum procedures and apply them. Finally — 
and this is the one requirement which will be most diffi- 
cult to meet— the scheme must produce a program in a 
reasonable length of time and not require a tremendous 
amount of memory space. 

With the exception of the last requirement, we believe 
that the following scheme is capable of satisfying all 
these requirements, and that perhaps this one too may 
be met. The techniques used have almost all been sug- 
gested in other contexts. 1 Freidberg’s programs 2 also 
have some organizational resemblances to the compiler 
described. 

A Compiler Using a Random Program Generator 
The Compiler 

Let us consider a new kind of compiler. It is similar in 
function to other compilers in that it accepts a program 
in language A and produces an equivalent program in a 
language B. Since the requirement of our languages is 
that each order must specify exactly an operation, these 
languages define computers. Therefore, we may some- 
times speak of computers A and B. The compiler op- 
erates as follows. It first generates a program in language 
B at random, as will be described below. Although the 
program must be of specified length, it may be any con- 
ceivable combination of orders. It then proceeds to de- 
termine whether this candidate program is equivalent 
to the given program in language A. The method of de- 
termining the acceptability of the program involves the 

1 “Automata Studies,” Ann. Math. Studies, no. 34, pp. 215-277; 
1956. Note last 4 articles by Ashby, Mackay, and Uttley. 

2 R. M. Freidberg, “A learning machine: part I,” IBM J. Res . 
Dev., vol. 2, no. 1, pp. 2-16; January, 1958. 


use of two interpretive routines, A and B, which are ca- 
pable of executing the orders in the two languages. The 
subject program and candidate program are then both 
executed using identical data to ascertain whether the 
candidate program B is capable of producing identically 
the same output. If it is determined that they are equiv- 
alent, then this program will be punched out and the 
translation will be complete. If not, a new candidate 
program is produced and tested in a similar fashion. The 
process is repeated until an acceptable program is pro- 
duced. 

The Random Program Generator 

The random program generator will have the follow- 
ing characteristics. Provided with the length desired, 
the first order of the program will be chosen at random 
from all possible orders that might appear there, so that 
the probabilities associated with the choosing of all 
orders are identical. The second order will be chosen in 
the same manner, and then each successive order in the 
program, until it is complete. This becomes the candi- 
date program. This method could generate any conceiv- 
able program of a given number of orders. However, the 
probability that a particular program is generated is ex- 
actly the same for all. The program generator will utilize 
a random-number generating routine, and the range of 
the random numbers will be partitioned in such a way 
that by assigning equal intervals to each order and se- 
lecting that order which corresponds to the interval con- 
taining the random number, a program may be gen- 
erated in the desired fashion. It is recognized that the 
computer can generate only “pseudo” random numbers 
and that this will introduce difficulties. It is not too 
much to expect that the numbers be distributed rec- 
tangularly. Of more importance, however, is the se- 
quence that the generation of these numbers may take. 
It will be necessary that the random numbers not ap- 
pear in a sequence so that the corresponding programs 
do not contain certain sequences of orders. We must 
continue to watch for this possibility even if the com- 
piler works, since it may be producing only certain kinds 
of programs. 

The Acceptability Criterion 

The Interpretive Routine Simulation of Computers A 
and B : The simulation of the A computer will be straight- 
forward. All one really needs to know is what the output 
is for a given input. However, we shall provide the inter- 
pretive routine that executes the A program with one 
additional feature, an “address stop.” That is, the pro- 
grammer, or in this case perhaps some other part of 
the compiler, will be able to specify, before transferring 
control to the A interpreter, a particular location which, 
if appearing in the control counter will cause the routine 
to jump to some previously designated location outside 
the interpretive routine itself. Any order which would 
otherwise cause the computer to stop would be treated 
in a similar fashion. 
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The B computer will be simulated in a like manner 
with the address stop feature. The B interpreter will 
also be equipped with a “clock” that will keep track of 
the running time of the program it is executing as if it 
were being executed directly by a real computer B. 
Thus, it will be possible to discriminate between two 
acceptable programs which have different running 
times. 

Also, by placing a limit on the length of time a pro- 
gram may run on interpreter B, it will have a means of 
discovering and stopping endless looping. The “clock” 
would be checked before each order was executed to see 
if the maximum time permitted had been exceeded. 

Our final specification will be that these interpretive 
routines be in some standard form as it is hoped that to 
adapt the compiler to any other two languages, it will 
only be necessary to put in the interpretive routines for 
those languages and to provide the random program 
generator with a list of the orders comprising the object 
language. 

The Use of Randomly Generated Data in Acceptability 
Tests: A first definition of equivalent programs only re- 
quired identical outputs given identical inputs. It would 
be possible of course to require that for two programs to 
be considered equivalent, they must use the same algo- 
rithm. However, given a program, it is not possible to 
recover the algorithm used, and the only alternatives 
are either that the routine itself be treated as the algo- 
rithm or that we consider only results. Acceptance is 
therefore determined by a statistical criterion and the 
probability of correct answers on a production run may 
be made arbitrarily close to 1 by increasing the number 
of randomly selected sets of input data for which the 
candidate program must successfully compute the cor- 
rect answers. It will, however, only be necessary to use 
one set of data until a candidate program succeeds in 
producing the correct output for that one. It will also be 
necessary that the range of the data variables be speci- 
fied along with the subject routine. 

The procedure will be to examine the output and com- 
pare it with that of computer A at the completion of the 
running of a program. If the output is different, the 
candidate program will be rejected as it would also be if 
a specified time on the “clock” were exceeded or if the 
computer “hung up.” If the output is found to be identi- 
cal to that of computer A, then a new set of data would 
be generated and run through computer A with the sub- 
ject program to determine the correct output; and com- 
puter B would be given the same data and the candidate 
program executed again. When an acceptable program 
is found, it may either be punched out on cards or tape 
or else retained and a search made for a better program 
in terms of number of words and running time. 

We would be fortunate indeed if the compiler, as it 
has been described, could produce a program of mod- 
erate size in the life of the machine, much less within the 
hour or so maximum that might be allowable. Neverthe- 
less, aside from this difficulty, this compiler would do 


everything else that was desired. If the compiler were 
allowed to run long enough so that it could choose 
among the best of many acceptable programs, it would 
not only tailor the program to the machine involved but 
also in fact find many new “coding tricks” that a pro- 
grammer might never stumble upon. How then can the 
expected time involved in a translation be cut down? 
Two methods will be discussed. The possibility of break- 
ing the subject program into parts and translating one 
part at a time will first be considered. Then we shall 
consider how the program generating part of the com- 
piler may be replaced by a unit which will gradually 
modify the probabilities associated with the generation 
of programs in a manner such that they will produce ac- 
ceptable programs more frequently. 

Sectioning the Subject Program 

Criteria for Sectioning: If it is desired to break the 
subject program into sections, it will be necessary that 
for a given section there be a criterion for the accepta- 
bility of that section. Also, it would be required that this 
criterion be such that if all the sections were correct, the 
entire program would also be correct. If a method of 
sectioning can be found and these requirements met, 
the compiler could then handle one section at a time in 
the same way it previously handled the whole program, 
and such a procedure might be much quicker. 

The acceptance or rejection of the program described 
in the previous section was based on the fact that both 
computers had input and output devices and, further- 
more, that these input and output devices were enough 
alike in the form in which they handled the data to make 
comparison easy. However, when considering a section 
of a program, we can no longer define correctness in 
these terms because it is unlikely that input and output 
operations even occur in that section. The input and 
output devices were treated as equivalent parts of the 
computer. However, other parts may also be treated as 
equivalent, and this will make possible the development 
of adequate criteria for the testing of sections. Since it 
will be the procedure to test sections by executing them, 
it will be necessary to specify a set of states, one of 
which computer B should be in at the termination of the 
execution of a section. Since this set must be determined 
by the state of computer A at the same stage, there 
must be a one-to-many mapping of the states of com- 
puter A onto B. This is obtained by defining equivalent 
parts. One natural equivalence is that the contents of 
the two memories should be in some sense analogous. It 
might be said that the state of A is equivalent to the 
state of B if their memories contain identical informa- 
tion. If it were required that every part of computer A 
have an equivalent part in B, then the normal opera- 
tions of B would have to be abandoned and B made to 
mimic A’s every move. This indeed would be undesira- 
ble since, for example, if it were required that B be in an 
equivalent state to A after every order, then even if it 
were a decimal machine, B would have to find a way in 
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which it could do operations such as forming logical 
products, which are extremely awkward in any system 
other than a binary one. Therefore, equivalent parts 
will only be defined when the equivalence is natural. 
Initially, this will mean that the memories, control 
counters, and input and output devices must be made 
to correspond. If only equivalent parts may be ex- 
amined, it is necessary that only those parts of the com- 
puter may contain information relevant to the program. 
This then requires that our sections be constructed so 
that if the state of all other parts of computer A were 
altered at the time when tests for equivalent content 
were being made, there would be no interference with 
the correct operation of the program. This requirement 
will be used in sectioning the subject program. The sub- 
ject program will be “cut” between those orders when 
the state of all nonequivalent parts of computer A may 
be altered without affecting the running of the program. 
The procedure will be that all possible “cuts” be made, 
the states of those nonequivalent parts altered on the 
basis of a random number, and the program continued. 
Those cases where no errors are introduced will then be 
recorded and the program divided at those points. 

The comparison of the inputs and outputs, if any, 
and the control counters is fairly straightforward. The 
addresses specified in the control counter would have 
to be compared by ascertaining whether or not the sec- 
tions referred to by the two addresses are equivalent. 

The comparison of the two memories is more difficult. 
Regarding memory itself, it is clear that locations which 
have in them numbers that are computed functions of 
the data should be compared for equality. Orders which 
ha.ve been modified by a section must somehow be ex- 
amined to see if they will perform correctly in their modi- 
fied form. Also, the contents of other locations may have 
information which depends for its form on the order 
code of the particular computer, for example, “binary 
switches” and “dummy” orders. It is necessary then to 
establish the correctness of such locations by linking 
them with other sections. Thus we can conclude that if 
these sections are acceptably executed throughout the 
entire program, then the location in question must be 
acceptable. A similar procedure is used for sections 
whose function is to modify instructions. For each sec- 
tion this entails making a list of sections whose ac- 
ceptance must come prior to the acceptance of that sec- 
tion. In some cases it may be necessary to work back- 
wards from sections which only operate on data and 
may be checked immediately through several different 
sections, the acceptance of each one being a necessary 
requisite for the acceptance of a prior one. 

Admittedly, the above discussion is not complete and 
there may be situations arising which we have not con- 
sidered. Nevertheless, it is felt that the method itself is 
powerful and can probably be adapted to new diffi- 
culties as they arise. 

Expected Running Times: The purpose of sectioning 
the program is, of course, that it is desirable not to 


throw out a program completely when only a part of it 
is incorrect. However, though it might seem obvious 
that the expected running time would be reduced, this 
does not follow just because the program is being com- 
piled in sections, as will be seen. 

Consider first the probability that an entire program 
generated by the random program generator is exactly 
a given program. If the program contains l orders, each 
of which might be any of k different orders, this proba- 
bility will be l/k l . The expected number of trials until 
it is constructed will be k l trials. Now, if the program is 
sectioned into g parts of lengths mi, m 2 , • • • , m a , then 
the probability of a particular candidate section being 
a particular one is the expected number of trials 

is k mi , and the expected number of trials for the entire 
program is 

a 

X) k mi . 

*= l 

However, this is not the situation. In fact, there are 
many acceptable programs. If there are c acceptable 
programs of length /, then the expected number of trials 
to hit one of them will be k l /c, and if the program is di- 
vided into g sections with lengths mi, m 2 , ■ • • , m g and 
there are C{ acceptable ways of writing the Ah section, 
then the expected number of trials to translate the en- 
tire program will be 

0 

Z , , 

If 

0 

XI c i ~ c i 

i=l 

then it could be concluded that the expected number of 
trials for the procedure using sectioning is much less 
than the other one provided the minimum expected 
number of runs for any section is greater than l/g ff_1 . 
For g>l, 1 <(l/g e_1 ) <2 and the expected number of 
runs for any section will never become that low. How- 
ever, by requiring the program to achieve certain cri- 
teria at many points during its execution, many pro- 
grams that would satisfy the criteria of identical out- 
puts will be rejected. Therefore, in general 

n o < c. 

i=l 

There is no a priori method then of saying conclusively 
that sectioning will lower the expected number of runs, 
although it seems that it would except in unusual cir- 
cumstances. However, it might at some point be possi- 
ble to pool some of the sections and reduce expected 
translation time, while at the same time increasing the 
probability of producing a better program because of 
the fewer restrictions. 
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The Adaption of Learning Models to 
the Compiler 

The Alteration of the Random Program Generator to Per- 
mit Learning 

So far, the random program generator has selected 
each of the k possible alternatives at each step with 
equal probabilities. It is entirely possible to allow the 
orders to be selected with other probabilities. Given a 
set { P } of probability vectors P = (pi, p 2 , • • • , pk) 
such that 

h 

= i 

i=»l 

where p x is to be associated with order Oi, the selection 
of an order may be made in the following manner. Given 
a pseudo random number R such that 0<i?<l, and a 
vector P, then Oj will be selected when 

?—i 3 

X Pi < R < Z) 

1=1 j—l 

If ft. has a rectangular distribution as it should have, 
then Oj will be selected with probability pj. A specific 
selection of an order will constitute a response, and the 
set of probability numbers (pi, p 2 , • ■ • , pk) will be 
abbreviated P. 

Intuitively, it is felt that some P’s will be more satis- 
factory than others, and a method must be found to ar- 
rive at some “best” one. First, some measure of per- 
formance is needed. The measure might include not 
only how often a particular P produces an acceptable 
program, but also whether the program is concise and 
has a short running time. However, it will be more con- 
venient at present only to consider a U(P)=Pr. (A 
candidate program using P is acceptable.) It would then 
be desirable to find the maximum of this function where 
P ranges over the set { P } and use the corresponding P. 
At present there is no information about this function. 
A guess might be made that it is continuous but has 
several modes. It will be assumed, however, that it has 
but one, the hope being that if any of them are discov- 
ered, a fairly satisfactory selection of orders will ensue. 

One method that might be used would be to modify 
the pi's whenever an acceptable program is achieved, 
or if sections are being dealt with, an acceptable section. 
Given that a particular order O t were in the acceptable 
section, the current set of pi’s would be replaced by 




/ , a - p.)\ / 


where d is a parameter that would reflect the magnitude 
of the desired change. This procedure would be repeated 
for each order in the acceptable section. The expected 


value of P can be shown to approach in expectation the 
value at which U(P) has its mode. It is desirable to let 
d be a function of the number of modifications already 
made, so that as a modal value is approached, the vari- 
ance about it will be decreased. 

This procedure might be called learning because it 
permits an increment in performance to occur as a re- 
sult of the “experience” of the compiler. 

The Stimulus Situation 

Consider now the situation of the response selector at 
a given point in its selection of orders for a candidate 
section. Previously, the P’s used for the selection of 
each order in the program have been identical. How- 
ever, if P y is the set of probability vectors that might 
be associated with the selection of a particular response 
y, it would seem that max U(P V ) will occur at different 
values of P for different values of y. In order to devise a 
method for obtaining these different values of P, there 
must first be a method of classifying the y’s. It may 
seem at first that y could be classified according to its 
location alone in the program, but this would not help a 
great deal. This seems to have been one of the difficulties 
of Freidberg’s programs. It is clear that most of the de- 
viation of P y from P may be explained in terms of y’s 
relation to the orders around it in the program. For ex- 
ample, P y should certainly depend on the selection that 
has already been made for (y— 1). Since max P y is con- 
ditional upon the value of (y — 1) , a set of k P’s may 
be used, one for each value of (y— 1). In general, not 
only will max P be dependent orL (y— 1) but also on 
(y — 2), (y — 3), and in fact on many other variables that 
might not even be defined in the candidate program. 

Other variables that should be considered are the 
particular orders that make up the subject section be- 
ing translated and the contents of certain locations in 
both the interpretive routines used. While the relation- 
ship of the subject orders might be obvious, that of the 
interpretive routines is probably not. The interpretive 
routines do contain variables in the sense that (y — 1) is 
a variable, for example, the “left-right counter” neces- 
sary in an interpretive routine simulating Princeton- 
type machines having two orders stored in one location 
in memory. In fact, it might be suspected that since a 
human being can know all there is to know of a com- 
puter’s importance to programming by examining an 
interpretive routine simulating it, indeed a great deal 
might be gained by considering the relationship be- 
tween max Py and some of these variables. The reason 
for the concern with the dependence of all these varia- 
bles with max P y is that prior to selecting the value of 
y, the particular values that these other variables have 
taken will be known ; and if the relationship is also recog- 
nized, a better P y may be used. Certain definitions fol- 
low naturally. If X = (xi, x 2 , • • • , x n ) is a set of variables 
whose values may be determined prior to the selection 
of order y and S= (xu, x 2 i, ■ • • , x n i), a set of particular 
values for X, then max P y . s will mean that value of 
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(Pj for which U(P y . s ) =P r (a response chosen using 
P y . s is acceptable given X — S) is maximized. S is called 
the stimulus vector, and Xi, x 2 , • • • , x„, stimulus vari- 
ables. 

The Selection of the Response 

Considering again the selection of the value of y, 
it is desired to find max U(P y . s ). If the best P y . s were 
determined independently for each possible and if 
d = 1, that is, if p c becomes 1 when c is the correct re- 
sponse for the specified S, and if all possible stimulus 
variables were included, the scheme would have these 
characteristics. It would classify completely all the cor- 
rect responses corresponding to the given stimulus 
vectors. But the same situation, that is, S, would never 
be encountered twice unless the scheme were again 
asked to translate the same program. Therefore, it 
would never be any better than a scheme which took 
no cognizance of the stimulus situation. Of course, this 
method is impossible because of the space requirements. 
Restricting the number of stimulus variables might be 
one way out. However, if one such variable could take 
on the average 40 values, the p’s expressed to five binary 
digits and &=40, the 512,000,000 words of 40 bits 
necessary to store the p’s for just five variables would 
be prohibitive. 

A Linear Modal 

Clearly the trouble lies at least partly in allowing the 
max Py. s to be freely determined for every S. While it is 
true that in general the effect of one stimulus variable 
xj on max P y . s will depend upon the values of the other 
stimulus variables, it is not necessarily the case that the 
effect of Xj on max P y . s will change for every change in 
the value of any stimulus variable. 

A procedure that would take advantage of this fact 
might be developed along the lines of linear regression 
theory. The modal assumes that max P y . s is a linear 
combination of max Pj,.*.. where Pyx-.f represents the 
fth probability vector associated with the stimulus vari- 
able Xj. Given then an5 = (xh, Xu, • • • , x n j), the value 
P y . s to be used in the selection of y would be 

Py. s = Ji(max Pyxu) 

+ 6 2 (max P y .x 2i ), • • • , 6»(max -Py ■*»<)• 

The max P y . x .fs would be estimated in the standard 
way; P y . x pi would be modified to increase p e when O c 
was an acceptable order occurring at y when Xj had 
value i. 

hj should be increased as all the individual probabili- 
ties within {Py. Xj } tend away from max P (max P taken 
in the sense of the last section). This could be accom- 
plished by letting hj increase in increments according to 
a sample taken from a random variable highly corre- 
lated with 

k 

( Pyxj.i. h Ph ) 2 

i—X 


where Py Xj .i. h is the ht\x component of max P y . x H and Ph is 
the hth. component of max P. Likewise, hj should be de- 
creased as Xj shows increasing correlation with the other 
variables of X. 

It is suggested that a measure of how much hj should 
be lowered because of its correlation with other varia- 
bles might be some function of the similarity of the par- 
ticular Pyxj.i to the other n — 1 vectors determined by 
the particular S. 

A method for the selection and rejection of stimulus 
variables would follow naturally. At periodic intervals 
that Xj whose hj were lowest would be discarded, and a 
new one selected at random from the remaining varia- 
bles not currently represented in the stimulus variable 
set. 

The model may be made even more flexible by per- 
mitting variables which are the products of two or more 
stimulus variables. 

A dubious future would seem to be ahead for the 
model described above if required directly to “learn” 
how to translate programs for an 1 103 A to 704 language. 
The unfortunate situation is, however, that almost 
nothing is known of the joint distributions of what have 
been called the stimulus and response variables in such 
a translating endeavor, and therefore our judgment of 
any model, prior to its incorporation in a translator pro- 
gram, must be intuitive. 

An Example 

For the purpose of exhibiting some of the general 
characteristics of the type of translator proposed, a 
program has been written that does in fact compile com- 
plete programs for an imaginary one-address machine 
given those for an imaginary three-address machine. To 
be sure the two machines used are neither of the size nor 
complexity of current machines, nor are they as dif- 
ferent from each other as current machines. Neverthe- 
less they are sufficiently powerful to be able to compute 
transcendental functions, invert small matrices, etc. 

Our experience with this compiler is limited and a 
full report on its performance is not yet ready; however, 
our results to date have proved both educational and 
encouraging. 

Our estimations of the initial expected time of trans- 
lation was of the order of one hour per instruction of the 
subject program. After gaining experience, the trans- 
lator should reduce this to about 90 seconds per subject 
instruction. In the first programs translated, the trans- 
later did considerably better than anticipated. Upon 
examining the programs produced, the reason became 
evident. We had overlooked a large class of acceptable 
programs, namely, those in which numbers were left in 
the arithmetic register of the one-address machine at 
the termination of a section; those same numbers could 
be of use in the succeeding section. The translator had 
promptly taken advantage of this. 

The principal difficulty encountered so far has come 
with conditional transfers of control. The trouble lies 
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in that the acceptance of a sequence containing a con- 
ditional transfer is contingent on its correct operation 
at several points in the program. Therefore, a large por- 
tion of the program must be executed before a candidate 
unit may be rejected or accepted. This implies that for 
very long programs the probability of success for a 
particular candidate unit containing a conditional 
transfer is going to have to be much higher than would 
be necessary for a purely arithmetic unit. This same dif- 
ficulty will arise in the situation of units whose function 
is to modify instructions. 

The translations produced have been far from opti- 
mal, primarily because the length of unit translated at 
one time was one subject order. If two at a time had 
been taken, the initial expected running times would 
have been much too great. However, the present trans- 
lator may now be modified to use its experience on one- 
at-a-time translation in translating two at a time, and 
the initial expected running times will be reasonable. 
This suggests that eventually the translator should 
choose the size of the unit it attempts to translate ac- 
cording to the subject orders involved. Thus the trans- 
lator would reduce the size of the unit attempted if it 


were recognized as one with too high an expected trans- 
lation time, and increase the size as it gained experience. 

Conclusion 

It was suggested at the beginning of this paper that 
the type of compiler described might not be of immedi- 
ate use. In the author’s opinion, this is not because our 
machines are too small and too slow, although certainly 
larger random access memories would be helpful. The 
cause seems to lie more in our ignorance of machine de- 
sign and programming, which amounts to essentially 
the same thing. Any increases in speed are quickly ab- 
sorbed in the realm of combinatorics. 

It is hoped that the ideas expressed here will have 
some value in finding the solution to this large class 
of problems, which includes not only compilers but 
human-language translation, game playing, and other 
problems where the initial complexities suggest a solu- 
tion that is self-improving. Development of informa- 
tion processing theory will certainly make these modi- 
fied British Museum techniques obsolete. Nevertheless, 
they may be of value in the interim and may be able to 
contribute to the more rapid development of that theory. 


Special-Purpose, Electronic Data Systems — The Solu- 
tion to Industrial and Commercial Automation 

WILLIAM V. CROWLEY f 


Introduction 

T HE concept that any “standard” electronic data 
processing or computing system should be adapted 
solely through programming to suit a particular 
business or industry is no longer supportable, except as 
an interim step for testing the planning and design of 
the data system through simulation. Nor does the 
answer to making “standard” electronic data systems 
more easily adaptable to various business applications 
lie in the area of so-called automatic programming. This 
is not to imply that this endeavor has not or will not be 
valuable and useful. 

Its main contribution, however, will be to force data 
definition discipline in business systems so that more 
powerful electronic data processing equipment, with 
more pertinent and functional instructions and com- 
mands, will be built. 

A careful study and comparison of the current data 
systems of many different types of business enterprises 

f Information Systems Dept., Ramo- Wooldridge Corp., Los 
Angeles, Calif. 


will show considerable similarity in the general struc- 
ture and media of the data systems, such as files, action 
documents, information documents, reports, etc., but 
will disclose numerous differences in the composition 
and arrangement of the data itself, as well as manage- 
ment’s philosophy of and concern with data handling. 
Some of these differences are degree of variability of the 
length of the data items; relative occurrence of alpha- 
betic and numeric information; language idioms or us- 
age peculiar to the business or industry; attitudes and 
practices concerning the coding of information ; govern- 
ment regulations; accounting customs, etc. Because of 
these existing variable factors and the social and eco- 
nomic resistance to absolute standardization, industrial 
and commercial data systems require equipment de- 
signed, not adapted, for their particular situation. 

Negative Influences 

Let us review some of the important factors which 
tend to inhibit this inevitable trend toward the building 
of electronic data systems tailored to a particular in- 
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dustry, type of business, public utility, or governmental 
organization. 

Perhaps the most important single deterrent to more 
rapid development and use of “specialized” general pur- 
pose electronic data systems is the lack of realization, 
understanding, and acceptance by top management 
officials of the ultimate advantages of a tailored, con- 
trolled data system in efficiency; compression of tradi- 
tional time cycles; provision of real cost reductions; and 
detection and elimination of errors. For example, most 
manufacturing executives are willing to invest in and 
work out financing for an expensive machine tool to im- 
prove performance and output. Many are still reluctant 
to invest the money and effort associated with imple- 
menting a large electronic data system. As the middle 
management of today moves into the top jobs the pace 
of electronicization of data flows will increase markedly. 

The second most influential inhibiting factor is that 
the business machine manufacturing industry is largely 
anchored to the past both philosophically and economi- 
cally. Conceptually, the idea of integrating the informa- 
tional aspects of business enterprises, and the design of 
techniques to accomplish this, did not come primarily 
from machine manufacturers, but from business systems 
analysts and imaginative managers who first understood 
the data flow requirements of business and then sought 
to learn of the capability of existing electronic machines. 
Many of the first electronic data systems were oversold 
through use of traditional marketing approaches. The 
state of the art of design and manufacture of electronic 
machines has advanced rapidly in the last four years. 
The art of applying the machines effectively has also 
advanced, but not as rapidly. Some commercial enter- 
prises such as insurance companies have advanced 
markedly in data systems understanding. Others, such 
as manufacturing concerns, have not progressed as 
much. 

Another important reason for a lag in advancing 
toward the use of special electronic data systems for 
general purpose use in a particular industry is the mat- 
ter of communication and agreement between the equip- 
ment designers and manufacturers; business systems 
analysts; equipment programmers and operators; and 
business operating and staff managements. The differing 
objectives of these groups and shortage of managers who 
understand; all these points of view have impeded the 
faster introduction and use of electronic data systems in 
business. The designers and manufacturing personnel 
are faced with keeping costs down, and as a result im- 
portant equipment functions may be left out. Similarly, 
a clever equipment design feature may contribute 
nothing but higher cost to the user. Competent business 
systems analysts who really understand the objectives 
of the enterprise and the data processing requirements 
are considered staff and are often not placed sufficiently 
high in the organization to enforce their improvements. 
Theirs is a continual job of “selling” both to top man- 
agement and operating management. The programming 


staff is always seeking to ease its function, and if a very 
precise statement of the problem is not presented, in- 
terpretations will be made which may weaken or change 
the desired end result. Top management wants quick 
results, whereas the relative efficiency of the existing 
system, and the specified objectives, and the inherently 
complicated planning and implementation procedure 
require a considerable investment in time, personnel, 
and money. Since a justifiably efficient EDP system 
implies functional integration, operating management 
may be reluctant to accept the inevitable re-centraliza- 
tion of authority. 

Many attempts are being made to reconcile these 
points of view through EDP equipment users’ organiza- 
tions, industry committees, conventions such as this 
one, and university and special courses which attempt 
to explain and present the various points of view ade- 
quately. 

Favorable Influences 

There are several distinguishable factors and de- 
velopments which promote and support the concept of 
“specialized” general purpose electronic data systems. 

Perhaps the most important positive indication of 
this trend is the existence and development of several 
such specialized electronic data systems and projects 
in different organizations. Some of the better known ex- 
amples are electronic equipment to maintain the status 
of airline reservations; equipment tailored to banking 
operations; equipment suitable primarily for the control 
of process industry operations; equipment especially de- 
signed for hospital operations; and government special 
systems for capturing, assimilating, and presenting de- 
fense warning information for field army operations, 
and data needed in U. S. Air Force logistics. 

Increasing costs of comparable standard systems 
and progress in the state of the art make it possible to ob- 
tain a specialized electronic data system at not too much 
greater cost than the ultimate cost of implementing a 
“standard” adapted system. Furthermore, the buyer 
is assured of a well operating system because of specially 
negotiated contractual terms. The increasing costs of 
standard systems are associated with increasing market- 
ing costs imposed by competition, the need for more 
highly trained people to harness the more powerful 
equipment, and higher development and manufacturing 
costs. 

Another significant indication of this trend is found 
in the actions of certain well-known accounting and 
business machine manufacturers to concentrate on a 
specific industrial or commercial business. 

Benefits of the Trend 

There are many advantages which will accrue from 
the extension of this predicted trend, toward the use of 
specialized general purpose electronic data systems. 

For the user this development will mean that the 
buyer and user are assured of getting a data system that 
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will work. Because of individually negotiated contracts, 
there will be no vague paragraphs in contracts about 
training of personnel, library and subroutines to be 
furnished, assistance in system analysis and program- 
ming, availability of various techniques of adaptation, 
and the attempt to force the user’s system into an arti- 
ficial equipment usage schedule which is not related to 
the specific situation. Under present terms the user will 
get a more or less good job done depending on the 
strength or weakness of the local equipment manu- 
facturers’ office. 

For the equipment manufacturers the most advantage 
will accrue to those who accept this trend first. It will 
also mean some changes in marketing and operating 
methods. The highly skilled sales team approach to sell- 
ing will be required. Thus many manufacturers will be 
placed at a temporary disadvantage depending on their 
present situation. Ultimately, manufacturers will bene- 
fit in that, although they may have to apply more sell- 
ing effort and customer assistance, there is a greater 
chance that they will be paid more fairly and adequately 
for their efforts. The relationship of manufacturers and 
buyers will be clarified, and there will be less chance for 
dissatisfied users — a condition which is harmful to the 
broad concept of electronic data systems and the entire 
industry. 

For the state of the art, advancement will accelerate 
because, with the closer user-manufacturer relation- 
ship the buyer will be willing to pay for advanced equip- 
ment built especially for him. 

For personnel trained and experienced in electronic 
data system techniques and an understanding of the 
business data requirements of the various commercial, 
industrial, and governmental enterprises, there will be 
expanded opportunities for employment and innovative 
work. 

For the economy as a whole, we can look for con- 
tinued increased productivity per individual, and the 
ultimate, virtual elimination of the wasteful errors of 
carelessness, misinterpretations and the varying appli- 
cation of logical rules which plague the modern business 
enterprise in its daily operations. 

Research and Development Required 

Before generalized special electronic data systems 
reach the anticipated rate of installation more pure and 
developmental research must be made in several areas. 

The first field of development has to do with further 
scientific investigation into the information require- 
ments of the various levels of management in the sev- 
eral types of business enterprises. In many industries 
the creation, maintenance, and dissemination of data 
has become an end in itself. De-integration and com- 
partmentalization of larger business organizations has 
given custody and control of certain portions of vital 
operational data to superficial or minor organizational 
units which are their only source of power. Often these 
data are in the form of reference files, where the chief 


file clerk is a force to be reckoned with in the existing 
administrative mechanism. Other artificial administra- 
tive data terminals may be illustrated by a coding sec- 
tion where data are coded for facilitation of processing 
and summarization. Still another artificial administra- 
tive data terminal is found in the specialty of cost 
estimating. Usually the data required to estimate any 
particular cost reside in or are generated by: 

1) Vendors’ files outside the company 

2) The engineering section 

3) Material control and purchasing section 

4) Production planning section 

5) Accounting department records 

6) Payroll department 

7) Written or verbal policies of profit margins and 
markups. 

These data are also required for many other purposes. 
Why then should not one central company file of neces- 
sary business data be maintained for presentation to 
the functional area officials as required? Under this con- 
cept management can truly manage, by exercising direct 
control over that vital body of company operational 
intelligence. 

A second area of investigation should be into the 
nature of various functional areas themselves. For ex- 
ample, such obvious questions as: What is production 
control? What information is used in making production 
control decisions? Where does it originate? Where does 
it go? How is it used? etc. For every type of business a 
study must be made by function, noting any peculi- 
arities. of a particular function in the specific business. 
The peculiarities themselves must then be analyzed to 
determine the significance of the differences. After logi- 
cal conclusions are reached a foresigh ted, enlightened 
management is required to implement the findings. 

These investigations can best be performed by com- 
petent independent organizations such as the business 
schools, research organizations, consultants, and in- 
dustry committees. 

Equipment Design 

Concurrent with applications, research should be con- 
tinuing in equipment design. It should be universally 
accepted among business data processing specialists 
that information files are the centers or center from 
which data to form decision patterns must come. There- 
fore, the handling of file maintenance, file reference, and 
data organization should be the primary area of re- 
search. The other significant problem is the accurate, 
rapid capture of raw data as they occur. In addition to 
the physical devices needed to capture the data there is 
often the problem of transmitting the data to some 
central location. This problem is more or less simple de- 
pending on the distance involved and the format of the 
data. 

Most of this is and should be carried on by the vari- 
ous manufacturers. There is an economic cost to this of 
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course, and unless some of the research is done by en- 
dowed organizations, the immediate costs are likely to 
be high. 

There must also be a closer liaison between the digital 
data processing engineers and the communications en- 
gineers. As electronic data systems become more re- 
sponsive, communicating and transmitting devices will 
be needed to connect the data processing center with 
the various segments of the system. Terminal data 
transfer and translation problems must be solved to 
permit the ultimate automation of data manipulation 
that is logically feasible. 

Conclusion 

The demise of the medium and large-scale general 
purpose electronic data processor or computer for busi- 
ness purposes is in sight. A sufficient number of indus- 


trial and commercial procedural analysts are now able 
to specify their data system requirements with cog- 
nizance of the speed and ability of electronic devices so 
as to build what is needed — not use just what is avail- 
able. Small general purpose computers and large ca- 
pacity computers for scientific calculation will continue 
in long usage. 

Many large companies with special electronic data 
handling problems have found the traditional large man- 
ufacturers of business machines unwilling to do more 
than tie together existing standard lines of equipment. 
Often unwilling to entrust the smaller electronic manu- 
facturers with their problems, several companies have 
embarked on their projects of tailor-made electronic 
systems. 

I predict that this trend will continue until, or unless, 
some better-known companies enter the field. 


The Residue Number System 

HARVEY L. GARNERf 


Introduction 

I N THIS PAPER we develop and investigate the 
properties of a novel system, called the residue code 
or residue number system. The residue number sys- 
tem is of particular interest because the arithmetic op- 
erations of addition and multiplication may be executed 
in the same time as required for an addition operation. 
The main difficulty of the residue code relative to arith- 
metic operations is the determination of the relative 
magnitude of two numbers expressed in the residue code. 
The residue code is probably of little utility for general- 
purpose computation, but the code has many charac- 
teristics which recommend its use for special-purpose 
computations. 

The residue code is most easily developed in terms of 
linear congruences. A brief discussion of the pertinent 
properties of congruences is presented in the next sec- 
tion. 

Congruences 

The congruence relationship is expressed as 
A = a mod b 

which is read, A is congruent to a modulo b. The congru- 
ence states that 

A = a + bt 

f University of Michigan, Ann Arbor, Mich. 


is valid for some value of t, where A, a, b, and t are 
integers, a is called the residue, and b the base or modulus 
of the number A. 

As examples of congruences, consider 
10 = 7 mod 3 
10 = 4 mod 3 
10 = 1 mod 3. 

In these examples the integers 7, 4, and 1 form a resi- 
due class of 10 mod 3. Of particular importance is the 
least positive residue of the class which in this example 
is one. The least positive residue is that residue for 
which O^a^&d 

Consider the following set of congruences: 

Given 

Ax = (*! mod b 
A n = a n mod b. 

Then 

1) Congruences with respect to the same modulus 
may be added and the result is a valid congruence. 

n / n 

X Ai = ( a 

1 The equality sign may exist on only one side of the expression. 


( ) 


mod b. 
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It follows that terms may be transferred from 
one side of a congruence to the other by a change 
of sign and also that congruences may be sub- 
tracted and the result is a valid congruence. 

2) Congruences with respect to the same modulus 
may be multiplied and the result is a valid congru- 
ence. 

n A* = ( n 

i — 1 \ t=l 

It follows that both sides of the congruence may 
be raised to the same power or multiplied by a 
constant and the result is a valid congruence. 

3) Congruences are transitive. If A=B and B^=C, 
then A = C. 

4) A valid congruence relationship is obtained if the 
number, the residue, and the modulus are divided 
by a common factor. 

5) A valid congruence relationship is obtained if the 
number and the residue are divided by some com- 
mon factor relatively prime to the modulus. 

The material of this section has presented briefly, 
without proof, the pertinent concepts of congruences. 
Additional material on the subject may be found in any 
standard text on number theory. 2 

Development of the Residue Code 

A residue code associated with a particular natural 
number is formed from the least positive residues of the 
particular number with respect to different bases. The 
first requirement for an efficient residue number system 
is that the bases of the different digits of the representa- 
tion must be relatively prime. If a pair of bases are not 
relatively prime, the effect is the introduction of re- 
dundancy. The following example will illustrate this 
fact. Contrast the residues associated with bases of mag- 
nitude 2 and 6 against the residues associated with bases 
of magnitude 3 and 4. In the first case, the bases are not 
relatively prime while in the second case the bases are 
relatively prime. The residues associated with the bases 
of magnitude 2 and 6 are unique for only 6 states while 
the residues associated with the bases of magnitude 3 
and 4 provide a unique residue representation for 12 
states. This is further clarified by Table I. 

An example of a residue number system is presented 
in Table II. The number system shown in Table II uses 
the prime bases 2, 3, 5, and 7. The number system, 
therefore, contains 210 states. The 210 states may cor- 
respond to the positive integers 0 to 209. Table II shows 
the residue number representation corresponding to the 
positive integers 0 to 29. Additional integers of the 
number system may be found by congruence operations. 
Let a, b, c, and d be the digits associated with the bases 
2, 3, 5, and 7, respectively. The following congruences 

2 G. H. Hardy and E. M. Wright, “An Introduction to the 
Theory of Numbers,” Oxford University Press, London, Eng.; 1956. 


TABLE I 

Redundancy of a Nonrelatively Primed Base Representation 


Least Postive Residue 


Number 

Mod 2 

Mod 6 

Mod 3 

Mod 4 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

2 

0 

2 

2 

2 

3 

1 

3 

0 

3 

4 

0 

4 

1 

0 

5 

1 

5 

2 

1 

6 

0 

0 

0 

2 

7 

1 

1 

1 

3 

8 

0 

2 

2 

0 

9 

1 

3 

0 

1 

10 

0 

4 

1 

2 

11 

1 

5 

2 

3 

12 

0 

0 

0 

0 

13 

1 

1 

1 

1 

14 

0 

2 

2 

2 


TABLE II 

Natural Numbers and Corresponding Residue Numbers 


Natural 

Numbers 

2357 

Natural 

Numbers 

2357 

Natural 

Numbers 

2357 

0 

0000 

10 

0103 

20 

0206 

1 

mi 

11 

1214 

21 

1010 

2 

0222 

12 

0025 

22 

0121 

3 

1033 

13 

1136 

23 

1232 

4 

0144 

14 

0240 

24 

0043 

5 

1205 

15 

1001 

25 

1104 

6 

0016 

16 

0112 

26 

0215 

7 

1120 

17 

1223 

27 

1026 

8 

0231 

18 

0034 

28 

0130 

9 

1042 

19 

1145 

29 

1241 


TABLE III 

Number of States and Digits Associated with a Residue 
Representation 


i 

Pi 

if. 

i=l 

n a 

i = 1 

Pi 

bits 

Z.Pi 

bits 

1 

2 

2 

2 

1 

1 

2 

3 

5 

6 

2 

3 

3 

5 

10 

30 

3 

6 

4 

7 

17 

210 

3 

. 9 

5 

11 

28 

2,310 

4 

13 

6 

13 

41 

30,030 

4 

17 

7 

17 

58 

510,510 

5 

22 

8 

19 

77 

9,699,690 

5 

27 

9 

23 

100 

223,092,670 

5 

32 


define a, b, c, and d for the residue representation of the 
number N: 

N = a mod 2 
N = b mod 3 
N = c mod 5 
N = d mod 7. 

The residue number system is readily extended to 
include more states. For example, if a base 11 is added 
to the representation, it is then possible to represent 
2310 states. Table III shows the product and sum of the 
first nine consecutive primes greater than or equal to 2. 




mod b. 
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The product of the primes indicates the number of 
states of the number system, while the sum of the 
primes is a measure of the size of the representation in 
terms of digits. Table III also includes the number of 
bits required to represent each prime base in the binary 
number system. 

Residue Addition and Multiplication 

The residue number representation consists of sev- 
eral digits and is assumed to be in one-to-one corre- 
spondence with some positive integers of the real number 
system. The digits of the residue representation are the 
least positive residues of these real positive integers 
with respect to the different moduli which form the 
bases of the residue representation. It follows as a direct 
consequence of the structure of the residue number sys- 
tem and the properties of linear congruences that opera- 
tions of addition and multiplication are valid in the resi- 
due number system subject to one proviso: the residue 
system must possess a number of states sufficient to 
represent the generated sum or product. If the residue 
number system does not have a sufficient number of 
states to represent the sums and the products generated 
by a particular finite set of real integers, then the residue 
system will overflow and more than one sum or product 
of the real number system may correspond to one resi- 
due representation. For a residue number with a suffi- 
cient number of states, an isomorphic relation exists 
with respect to the operations of addition and multipli- 
cation in the residue system and a finite system of real 
positive integers. 

Each digit of the residue number system is obtained 
with respect to a different base or modulus. It follows, 
therefore, that the rules of arithmetic associated with 
each digit will be different. For example, the addition 
and multiplication of the digits associated with moduli 
2 and 3 follow rules specified in Table IV. No carry 
tables are necessary since the residue number system 
does not have a carry mechanism. Addition of two resi- 
due representations is effected by the modulo addition 
of corresponding digits of the two representations. Cor- 
responding digits must have the same base or modulus. 
Modulo addition of digits which have different bases is 
not defined. Multiplication in the residue system is ef- 
fected by obtaining the modulo product of correspond- 
ing digits. The operations of addition and multiplica- 
tion of two residue numbers are indicated by the follow- 
ing notation: 

S = A 0 B 
p = A O B. 

Consider a residue number representation with bases 
2, 3, 5, and 7. We assume an isomorphic relation between 
the residue number system and the real positive num- 
bers 0 to 209. An isomorphic relation then exists for the 
operations of multiplication and addition only if the 
product or sum is less than 210. The following examples 


TABLE IV 


Mod 2 and Mod 3 Sums and Products 


© 

0 

1 








0 

0 

1 

© 

0 

1 

2 

O 

0 

1 2 

1 

1 

0 

0 

0 

1 

2 

0 

0 

0 0 











— 

1 

1 

2 

0 

1 

0 

1 2 

o 

0 

1 









2 

2 

0 

1 

2 

0 

2 1 

0 

0 

0 

sum mod 3 


product mod 3 

1 

0 

1 








product mod 2 









employing residue numbers illustrate the addition and 
multiplication operations and the presence of an iso- 
morphism or the lack of isomorphism in the case of over- 
flow. Residue numbers will be distinguished by the use 
of parentheses. 

29 + 27 = S = 56 

29 > (1 2 4 1) 

27^ (1 0 2 6) 

56^(0 2 1 0) 

(12 4 1) 

0 

(1 0 2 6 ) 

(0 2 1 o7 ' 

The following’operations are considered in performing 
the addition of the two residue representations: 

1 + 1 = 0 mod 2 " v 

2 + 0=2 mod 3 
4+2=1 mod 5 
1 + 6 = 0 mod 7. 

Consider the addition of two numbers which produce a 
sum greater than 209. 

S = 100 + 200 
„ (0 1 o 2) 

(0 2 0 4) 

(0 0 0 6 )’ 

The residue representation (0 0 0 6) corresponds to the 
real positive number 90. In this particular example, the 
sum has overflowed the residue representation. The re- 
sulting sum is the correct sum modulo 210. 

300 = 90 modulo 210. 

Finite real number systems and residue number systems 
have the same overflow characteristics. The sum which 
remains after the overflow is the correct sum with re- 
spect to a modulus numerically equal to the number of 
states in the finite number system. 
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The following is presented as an example of the proc- 
ess of residue multiplication: 

p = 10 X 17 = 170 (0 1 0 3) 

10o(0 1 0 3) (1 2 2 3) 

17 <-> (1 2 2 3) (0 2 0 2) * 

170 <-> (0 2 0 2) 

The process of multiplication involved consideration of 
the following relations for each digit: 

1X0 = 0 mod 2 
1X2 = 2 mod 3 
0X2 = 0 mod 5 
3X3 = 2 mod 7. 

An overflow resulting from a multiplication is no dif- 
ferent from the overflow resulting from an addition. 
Consider the product obtained from the residue multi- 
plication of the numbers 10 and 100. The result in the 
modulo 210 number system is 160, since 

1000 = 160 mod 210. 


The following examples have been chosen to illustrate 
the subtraction process and to some extent the diffi- 
culties associated with the sign of the difference: 

D=AQB=A®B'. 


We consider first the case where the magnitude of A is 
greater than B. 


Let A = 200 B = 100. 


In residue representation, 

B' = (0 2 0 5) 


and 


(0 2 0 4) 
(0 2 0 5) 
(0 10 2 ) 


The residue representation of the difference corresponds 
to positive 100 in the real number domain. We consider 
next the case where the magnitude of B is greater than 
the magnitude of A. 

A' = ( 0 10 3) 


Subtraction and the Representation of 
Negative Numbers 

The process of subtraction is obtainable in the residue 
number system by employing a complement representa- 
tion consisting of the additive inverses of the positive 
residue representation. The additive inverse always ex- 
ists, since each of the elements of the residue representa- 
tion is an element of a field. There is no basic problem 
assodiated with the subtraction operation. There is, 
however, a problem associated with the representation 
of negative numbers. In particular, some mechanism 
must be included in the number system which will per- 
mit the representation of positive and negative numbers. 
This problem is discussed here and in the following sec- 
tion. 

The additive inverse of a residue number is defined by 
the following: 

a © a' = 0. 


The formula may be considered to apply to a digit of 
the residue system or equally well to the whole residue 
representation. Consider the following examples with 
reference to the modulo 210 residue number system: 


then 


since 


a = (1 2 4 1) 

a' = (1 1 1 6), 

(! 2 4 i ) 

8 (1 1 1 6 ) 

(0 0 0 0 ) 


then 


D = A' © B 


and 

(° 1 0 3 ) 

(0 10 2 ) 

(0 2 0 5) 

The difference (0 2 0 5) is the additive inverse of 
(0102). Unless additional information is supplied, the 
correct interpretation of the representation (0 2 0 5) is 
in doubt. (0 2 0 5) may correspond to either +110 or 
- 100 . 

The difficulties associated with whether a residue rep- 
resentation corresponds to a positive or negative integer 
can be partially removed by the division of the residue 
number range into two parts. This is exactly the scheme 
that is employed to obtain a machine representation of 
positive and negative natural numbers. For the system 
of natural numbers, two different machine representa- 
tions of the negative numbers may be obtained and are 
commonly designated the radix complement represen- 
tation of negative numbers and the diminished radix 
complement representation of negative numbers. 

The complement representation for a residue code is 
defined in terms of the additive inverse. Thus, the 
representation of negative A is A' where A ®A' = 0, 
and the range of + is restricted to approximately one 
half of the total possible range of the residue represen- 
tation. This can be illustrated by consideration of a 
specific residue code. This residue representation em- 
ploying bases of magnitude 2, 3, 5, and 7, is divided into 
two parts. The residue representations corresponding 
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to the natural numbers 0 to 104 are considered positive. 
The residue representations corresponding to the 
natural numbers 105 to 209 are considered inverse 
representations and associated with the negative inte- 
gers from —1 to —105. The range of this particular 
number system is from —105 to +104. The arithmetic 
rules pertaining to sign and overflow conventions for 
this particular number system are the same rules nor- 
mally associated with radix complement arithmetic. 

The complement representation does eliminate in 
principle any ambiguity concerning the sign of the re- 
sult of an arithmetic operation. However, there is a 
practical difficulty. The determination of the sign as- 
sociated with a particular residue representation re- 
quires the establishment of the magnitude of the repre- 
sentation relative to the magnitude which separates the 
positive and negative representations. The determina- 
tion of relative magnitude for a residue representation 
is discussed in the next section, where it is shown that 
the determination of relative magnitude is not a simple 
problem. 


where 



m = XI m i • 

3=1 

The conversion formula for a particular residue number 
system is now obtained. 


mi — 2 m 2 = 3 

m 3 — 5 

105 Ax = 1 mod 2 

SO d i = 

70 A 2 = 1 mod 3 

SO A 2 = 

42 As = 1 mod 5 


2 s 1 mod 5 

so A 3 = 

30 A 4 = 1 mod 7 


2 A 4 = 1 mod 7 

so A 4 = 


105 a\ + 70 #2 + 126 a s + 120 a± = S mod 210. 


Conversion from a Residue Code to a 
Normal Number Representation 

It is frequently desirable to determine the natural 
number associated with a particular residue represen- 
tation. The need for this conversion occurs frequently 
in investigation of the properties of the residue system. 
The residue representation is constructed in such a 
manner that magnitude is not readily obtainable. The 
presence of digit weights in the normal polynomial type 
number representation greatly facilitates the deter- 
mination of magnitude. However, it is possible to assign 
a weight to each digit of the residue representation in 
such a manner that the modulo m sum of the digit- 
weight products is the real natural number in a con- 
sistently weighted representation, m is the product of 
all the bases employed in the residue representation. 
The conversion technique is known as the “Chinese 
Remainder Theorem.” The material which follows de- 
scribes the conversion technique but omits the proof. 
A simple and straightforward proof is found in Dickson. 3 
The proof does not refer specifically to residue number 
systems, but rather to a system of linear congruences. 
If so regarded, a system of congruences defines a com- 
ponent of a residue number system. 

Consider a residue number system with bases 
m • • • m t . The corresponding digits are labeled 
ax • • • a t . The following equations define the conversion 
process : 

m m 

axAx {-•••+ a t A t — = S mod m 


* L. E. Dickson, “Modern Elementary Theory of Numbers,” 
University of Chicago Press, Chicago, 111., p. 16; 1939. 


The conversion formula is now used to determine the 
natural number corresponding to the residue representa- 
tion (1 2 0 4). 

105 (1) + 70 (2) + 126 (0) + 120 (4) = 725 

725 = 5 mod 210 
5 = 95. 

The conversion process described above requires con- 
ventional multiplication and modulo addition. 

Other conversion techniques exist. In particular it is 
possible by means of a deductive process to determine 
the magnitude of a particular residue representation. 
This requires both a knowledge of the nature of the 
residue system and the natural number representation 
associated with at least one residue representation. 

Due to the deductive nature of the process, it is more 
suitable for human computation than for machine com- 
putation. The process is explained using the residue 
number of the previous example (1 2 0 4). The knowl- 
edge of the residue representation for unity which is 
(1111) is assumed. Consider the effect of changing the 
second digit from one to two. The change adds the 
product miW 3 W 4 = 70 to the number, since 70 is con- 
gruent 1, modulo 3. The resulting residue representa- 
tion (12 11) corresponds to 71. The effect of changing 
the third digit is to change the magnitude by some 
multiple of the product mim 2 W 4 = 42. The correct change 
in magnitude is 42x where 42x = 4 mod 5 ; so 42x = 84 and 
the residue representation (12 0 1) corresponds to 155. 
The fourth digit is modified by the addition of a three. 
The effect of this change is determined by 30x^3 mod 7. 
The magnitude change is 150. The sum of 150 and 155 
modulo 210 yields the correct result 95, in correspond- 
ence with (1 2 0 4). 
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Sign determination for the residue code is dependent 
on the determination of a greater than or less than re- 
lationship. A possible method might involve the con- 
version techniques described previously. Such a scheme 
would involve the standard comparison techniques as- 
sociated with the determination of the relative magni- 
tude of two numbers represented in a weighted repre- 
sentation. An alternate conversion procedure yields a 
conversion from the residue code to a nonconsistently 
based polynomial number representation by means of 
residue arithmetic. Consider a residue code consisting 
of t digits. The t digits of the residue code are associated 
with t congruence relationships as follows: 


The system of equations shown below is generated by 
repetition of the above substitution process until no 
equations remain. 

S = a t + A t m t 

t— i 

A t = d t + At-imt-i 

t- 2 

A t ~ i = d t ~ i + A t -<0i t -2 

A 3 = d s 2 + A 2 m 2 
A 2 = d 2 mod mi 

The equations are combined to yield 


.5 = ai mod 1 < i < /. 

S is the magnitude of the number expressed in normal 
representation. It is also possible to express the number 
5 as 

S = ai + A {Mi. 

At is the integer part of the quotient of 5 divided by w,-. 
In regard to a greater or less than relationship, the de- 
termination of A i divides the range of the residue repre- 
sentation into m/mi parts. We proceed to calculate Ai 
from the set of t equations given above. Let 

m 

S = a t + A m t A t < — • 

m t 

This equation is then used to replace 5 in the remaining 
t — 1 equations, yielding t — 1 equations of the form 

A t m t = {ai + a t r ) mod 1 < i < t — 1 


or 

A t = ( ai + at)/m t * mod w* 

At = d t * mod mi 

where /m t * is the multiplicative inverse of m t with re- 
spect to base m*. The multiplicative inverse is defined as 4 

Xt/xt* = 1 mod mi. 

d t * is the least positive residue of (af+a/)/w t with 
respect to base i. 
a t ' is the additive inverse of a t . 

Let A t be expressed as 

m 

At = dt 1 + A t -\mt-\ At - 1 < • 

m t m t - 1 

If this expression is substituted for A t a set of if — 2 equa- 
tions remain. The equations are of the form 

A t - 1 = [dj + (dd -1 )']/mi_i mod m % 1 < i < t — 2 
A t -i = d* t - 1 mod 

4 The existence of the multiplicative inverse reauires that xt and 
mi be relatively prime. 


/ t— 1 r t— 2 t—z 

S = a t + m t \dt + m t -i[dt-i + m t - 2 \dt -2 + • • • 

t— 1 t—2 t— Z 

= a t + m t d t + m t m t -idt-i -f- m t mt-im t - 2 dt -2 + 


m 

H d 2 1 

Ml 

where 


A t < m t 

t—n— 1 

dt—n ^ rrit—n—i- 

Therefore, A is never equal to or greater than m and 
oV divides the range into «i parts, d 3 2 divides each of 
the mi parts into m 2 parts, dT divides each of the m 2 
parts into m% parts, etc. 

The determination of the less than or greater than 
relationship consists of the successive comparison of 
the d\ I” -1 constants corresponding to two residue 
representations. Let the representations be designated 
E and F. The first step of the greater than or less than 
determination is the comparison of d</-{E) and d 2 l {F) s . 
If the two constants are different the process may be 
terminated and the larger number is associated with the 
larger constant. If the constants are equal in value, the 
comparison process must consider the pair of constants 
d 3 2 (E) and d 3 2 (A). The process is continued in this 
manner until a set of nonidentical constants is found. 
If all of the d constants are identical, a final comparison 
is made on the basis of the pair of Ah digits of the two 
residue representations. 

The formulas which define the greater than, less than 
process may be applied recursively to obtain a formula 
for a greater number of digits. The process has been 
extended to five variables and the results are shown in 
Fig. 1. 

Admittedly, the process required to obtain a greater 
than or less than relationship leaves much to be desired. 
One presumed advantage of the residue number system 
was the absence of a carry process. The greater or less 
than process is essentially sequential and is in many 
ways similar to the carry process of ordinary arithmetic. 
The ultimate usefulness of the residue code for general- 
purpose computation appears very much dependent on 
the development of simple techniques for the determina- 
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tion of the relative magnitude of two residue code 
digits. 

Division 

The division process for residue codes is complicated 
by two factors. The first is the absence of a multiplica- 
tive inverse for the zero element. The second difficulty 
is the fact that residue division and the normal division 
process are in one to one correspondence only when the 
resulting quotient is an integer value. We shall consider 
first the problem of residue division of the elements of 
a single field and shall consider later the elements of 
several fields considered as a residue code. The division 
process represented in equation form as 


a 



implies 

a — bq. 

The difference between normal arithmetic and residue 
arithmetic is that in residue arithmetic the product bq 
need not necessarily be equal to a; only the congruence 
of a and bq is required. 


bq = a mod m n 

Multiplication by the multiplicative inverse of b desig- 
nated /b obtains 

q = a/b mod m n . 

The correct interpretation of q in the above equation is 
that the number a is obtained by forming the modulo 
sum consisting of b, q representations. The sum is car- 
ried out in a closed and finite modulo number system 
of base m n . Thus, q corresponds to the quotient only 
when the quotient has an integer value. Examples may 
be obtained from the consideration of a modulo 5 
number system: 


4 



2q = 4 mod 5 
q = 2 mod 5 
4 



3q = 4 mod 5 
q = 3 mod 5 
note 3X3 = 4 modfj5 
3 



4 5 = 3 mod 5. 
q = 2 mod 5. 

In the above examples, q corresponds to the quotient 
only in the first example. 

The residue code representation of a number consists 
of many digits, A — {a x , a 2 , • • • , a n ). Each digit of the 
representation is associated with a different prime base. 
The number system is a modulo m system where 

n 

m = XI m % • 

i— 1 

The division of two numbers in residue code may be 
expressed by a system of congruences. The solution 
Q = (qu qi, • • • , q n ) must satisfy all the congruence re- 
lationships of the system. A zero digit in the divisor 
B = (&i, & 2 , • • • , b n ) means that B and m are not rela- 
tively prime hence the multiplicative inverse of B does 
not exist. 

QB ^ A mod m. 

For the special case in which bi — 0 and ai = 0, a valid 
congruence relationship of the form 

QB A m 
— - = — mod — 

mi mi mi 

is obtainable. 
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The process of residue division has certain interesting 
properties and quite possibly has applications in respect 
to special problems. Unfortunately, the residue division 
process is not a substitute for normal division. It ap- 
pears that the only way in which division can be effected 
in the residue code is by the utilization of techniques 
similar to those employed for division in a consistently 
weighted number system. The division process then 
requires trial and error subtraction or addition and the 
greater than or less than relationship. The division 
algorithm could also include trial multiplication, since 
in the residue system addition and multiplication re- 
quire the same period of time. 

Conclusions 

The material of this paper forms a preliminary in- 
vestigation of the applicability of residue number sys- 
tems to the arithmetic operations of digital computers. 
The residue system has been found attractive in terms 
of the operations of multiplication and addition. It is 
possible to realize practical logical circuitry to yield the 
product in the same operation time as for the sum, since 
the product is not obtained by the usual procedure qf 
repetitive addition. The main disadvantages of the 
residue number system are associated with the necessity 
of determining absolute magnitude. Thus, the division 
process, the detection of an overflow, and the determi- 
nation of the correct sign of a subtraction operation are 
processes which at this stage of the investigation seem 
to involve considerable complexity. Nevertheless, many 


special-purpose applications are certainly well-suited to 
the residue code. In particular, there exists a class of 
control problems characterized by the absence of the 
need for division and the existence of a well-defined 
range for the variables, and also by the fact that the 
sign of the variables is known. For the problems of this 
class, the use of the residue code should result in a re- 
duction of the over-all computation period and should 
yield a computer with a higher bandwidth than obtain- 
able with the conventional number system. 

The ultimate usefulness of the residue code will 
probably be determined largely by the success of the 
circuit designer in perfecting circuitry ideally suited for 
residue code operations. 

The material of this paper is essentially Chapter 5 
of the author’s doctoral dissertation. 5 At the time of 
the completion of the dissertation, the author was un- 
aware of the work of M. Valach 6 and A. Svoboda 7,8 in 
Czechoslovakia. Additional literature 6-8 was obtained 
from recent visitors to the Soviet Union. The author 
wishes to take this opportunity to acknowledge the 
work of Valach and Svoboda. 

5 H. L. Garner, “Error Checking and the Structure of Binary 
Addition,” Ph.D. dissertation, University of Michigan, Ann Arbor, 
pp. 105-140; 1958. 

6 M. Valach, “Vznik kodu a ciselne soustavy zbytkovych trid,” 
Strop Na Zpracovani Informaci, Sbornik III; 1955. 

7 A. Svoboda and M. Valach, “Operatorove obvody,” Stroje Na 
Zpracovani Informaci, Sbornik III; 1955. 

8 A. Svoboda, “Rational numerical system of residual classes,” 
Stroje Na Zpracovani Informaci, Sbornik V; 1957. 


System Evaluation and Instrumentation for Military 
Special-Purpose Digital Computer Systems 

A. J. STRASSMANf and L. H. KURKJIAN f 


Introduction 

T ESTING and instrumentation are essential pre- 
requisites for the completion and operation of any 
new system. A system can be defined as a number 
of components that are amalgamated or integrated to- 
gether to perform a desired operation. Throughout this 
paper a “component” is considered to be a complete 
functional part of a data processing system such as an 
arithmetic unit or a buffer. To ascertain if a component 
in the system is going to perform correctly its specific 
function, it is sometimes necessary for the implementa- 

f Hughes Aircraft Co., Fullerton, Calif. 


tion of tests to be more complex than the component 
undergoing the testing. This becomes apparent when 
the component is a part of a large system and has many 
inputs and outputs. 

To prove the system feasibility or operation of the 
components it is necessary to do either of two things: 
1) duplicate and maintain an entire system and use it as 
one master test fixture to evaluate each functional com- 
ponent; or 2) provide individual test facilities for the 
evaluation of each of the functional components. The 
second approach requires the design of simulation equip- 
ment to provide the necessary inputs (control signals 
arid data) to check out completely the operation of each 
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individual component. It is believed that this approach 
offers the greatest advantages for large special-purpose 
digital computer systems. 

System Organization Determines Work 
Organization 

It is necessary to provide the proper work organiza- 
tion for the evaluation of these computer systems. A dif- 
ferentiation can be made between small and large sys- 
tems and the work organization can be adjusted accord- 
ingly. Although the basic philosophy of test remains the 
same, the details evolved for the testing or evaluation of 
a small system will be different from that evolved for a 
large system. Since the flip-flop is a basic element in 
many digital computers, the number of flip-flops can be 
used as an indication of the size and complexity of the 
system. For the purpose of this paper, in which the eval- 
uation and instrumentation of a large system is de- 
scribed, a “large” system is defined arbitrarily as one 
that contains more than 500 flip-flops. 

In the case of a small system, all the work can be 
handled by a small group which will perform the neces- 
sary tasks from system design to final evaluation. Fig. 
1 (a) shows the work flow for the “small system” organ- 
ization. The work usually begins with a proposal out- 
lining the new system. This is followed by system de- 
sign, logical design, circuit design, testing, evaluation, 
and delivery. It is admitted that to be able to perform 
all the tasks included, the technical personnel associated 
with a small system must have a broader background 
than those required for the evaluation of a large system. 

The actual limitations of time, complexity of large 
systems, and efficiency of utilization of personnel, neces- 
sitates the use of specialists in each specific work area to 
perform all the necessary tasks to complete the large 
system. Specialization is indicated by the fact that the 
system design is done by a group of systems engineers 
whose function is to define the necessary components 
needed to implement the system and their interrelation- 
ships. Logical design and circuit design are two func- 
tions that are performed in an analogous manner by 
logicians and circuit engineers who are specialists in 
their respective realms. The test and evaluation of the 
system is also handled in a specialized manner. Each 
component is assigned to a circuit or unit engineer whose 
responsibility is to 1) design the logical circuitry of the 
component from the Boolean equations, 2) design the 
test fixture, 3) write the necessary procedurals, and 4) 
test the component when it is fabricated. 

Since a large system requires many individuals to 
complete these tasks, the question of communication be- 
tween groups of specialists becomes a problem. This is 
an important item especially when the decision of one 
group vitally affects the work of another group or 
groups. To maintain a continuous flow of data in both 
directions which enables each group to perform their 
tasks more efficiently, information feedback loops are 
provided in the form of signature control and written 
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Fig. 1 — Work organization for typical small (a) and large (b) systems. 


reports. Fig. 1(b) shows the interrelationship of all 
groups from system design to delivery. 

The philosophies contained within this paper led to 
the basic planning considerations for the test and evalu- 
ation of a large special-purpose military data processing 
system, parts of which will be described in later para- 
graphs. This data processing system contains approxi- 
mately 1500 tubes, 2500 transistors, 250,000 diodes, and 
3500 flip-flops. Each flip-flop in the system has four 
transistors, making a total of 16,500 transistors in the 
entire system. This qualifies the described system to be 
classified as a large system. 

Types of Tests to be Performed 
for Evaluation 

The basic parts of any large system can be broken 
down into five categories which are listed in their order 
of complexity: 1) elements, 2) units, 3) components, 4) 
subsystems, and 5) systems. If these basic parts of the 
system are evaluated and tested in order of complexity, 
the sequential building of testing integrity gives us the 
understandable advantage of solving small problems 
first before becoming involved with the intricacies and 
troubles inherent in any large system. The first type of 
tests to be performed therefore would be element tests. 

Element Tests 

The basic computing elements are usually flip-flops, 
logical followers, drivers, shift registers, diode cards, 
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Fig. 2 — Standard elements and standard 
element test fixture. 


and pulse regenerators, etc. These items are referred to 
as “standard elements.” The functional requirements of 
each of these standard elements determine the design 
of the test fixture necessary for evaluation. The test 
fixture for a flip-flop contains the necessary steering cir- 
cuitry which makes the flip-flop a modulo 2 counter. 
When more than one flip-flop is built on a standard card, 
the fixture can be expanded to make the flip-flops count 
in any prescribed manner. Shift registers, followers, and 
drivers most commonly are evaluated by inserting spe- 
cific computer word patterns at the input and observing 
the appropriate outputs. Simple sequential relay cir- 
cuits are used to step through the forward and reverse 
characteristics of diodes on standard diode cards. Typi- 
cal standard elements of a digital computer system 
along with a standard element test fixture are shown in 
Fig. 2. 

The electronic implementation of the digital cbm- 
puter logic that is represented in Boolean notation is 
formed from the standard diode card and specific re- 
sistor networks on the matrix card assembly. The wiring 
of the resistors to the diodes on each individual matrix 
card determines its logical function. The logical function 
of each card can be statically evaluated by a “matrix 
card tester” which simulates each input term to the 
card. The output of each gate is monitored as logical 
true and false levels are placed at the inputs to the gate. 
A meter is used to indicate to the operator the result of 
the simulation of any of the logical terms under test. Fig. 
3 is a photograph of a matrix assembly and a matrix 
card tester. 

Unit Tests 

Each module of the system under discussion is called 
a unit and contains up to twenty-one element cards. A 
unit module is shown in Fig. 4. The combinations of, 
and connections between, the elements in the unit pro- 
vide a portion of an over-all computing function that is 
to be performed by the system. Evaluation of units is 
difficult because units are incomplete functional items 



Fig. 3— Matrix assembly and matrix tester. 



Fig. 4 — Unit module. 


and therefore the amount of simulation becomes large 
and complex. However, it is considered that this step in 
the system evaluation is critical. It is therefore neces- 
sary to ascertain that each unit has been tested to th 
maximum. This obligates us to perform the most ex- 
haustive tests possible on the unit level within the 
framework of the computer. Provisions to accomplish 
this can only be done by generating ideal simulated sig- 
nals that the unit would expect in system operation. 
This type of simulation has been achieved by the design 
of equipment referred to as the “unit tester.” The unit 
tester provides combinations of static and dynamic sig- 
nals to the unit under evaluation. All system timing sig- 
nals, synchronizing signals, and data inputs are gen- 
erated in the unit tester. Each connection in and out of 
the unit under test is available on a patch panel on the 
unit tester. The choice of four signals is available at each 
point. The point may be 1) connected to a logical true 
signal, 2) connected to a logical false signal, 3) con- 
nected to a special function (sync, timing, data, etc.), or 
4) unconnected if it is an output of the unit that is to 
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be observed. The unit test insures that the interelement 
wiring and the input-output wiring of the unit is correct 
and at the same time provides a semidynamic test to 
the various element configurations. Many of the logical 
functions can be completely evaluated during this phase 
of test. The unit test can be easily modified for produc- 
tion testing of each module by simple automation tech- 
niques. In Fig. 5 a unit is shown under test with the 
unit tester. / 

Component Tests 

A system component is a unit or group of units that 
has been defined in the system to perform a particular 
computing or data processing function. Examples of 
typical components are the arithmetic unit, the various 
buffers, the computer controls, and the buffer controls. 
It is at this level that the simulation of external signals 
is very important, as the completeness of testing at the 
component level determines the ease with which it is 
possible to test and evaluate the entire system. The 
component test provides for the testing of all of the 
logic contained within the integrated units by means of 
external simulated signals. These external signals are 
developed by a special test fixture that is unique for 
each component. A component consisting of eight units 
mounted on its test fixture is illustrated in Fig. 6. The 
test fixture is designed to simulate the complete system 
to the component. This test is basically dynamic and as 
a consequence logical errors can be discovered during 
this phase of evaluation. The simulation equipment 
consists of the appropriate switches, function gen- 
erators, and timing and synchronizing signals that the 
component would operate from if it were in the system. 
Procedurals are written which outline the detailed steps 
necessary to evaluate the component function as speci- 
fied by the system design. This test actually proves or 
disproves the component logic with the test fixture as 
the system simulator. Both the test fixture and the pro- 
cedurals are designed and written by the cognizant cir- 
cuit or unit engineer who is charged with the responsi- 
bility of this component and has by necessity a com- 
plete grasp of the functional operation of this compo- 
nent. Typical examples of system components and an 
idea of their complexity are given in the following para- 
graphs. 

1) The coordinate extrapolator updates coordinates 
on the basis of velocity stored in the memory. This com- 
ponent requires twelve flip-flops, eight logical followers, 
and 180 diodes. The logic written in Boolean notation 
consisted of three typewritten pages and the test pro- 
cedural was nine pages long. The control and addition 
logi'c in this component were evaluated by means of a 
component test fixture which simulated the system in- 
put coordinate data, velocity, and time by means of 
variable word generators and counters. 

2) The computer control is a special-purpose wired 
program computer that controls information from and 
to three arithmetic units. Its outputs include control 



Fig. 6 — Component test fixture. 


signals and generation of appropriate constants needed 
during the various steps in the wired program. The com- 
ponent was implemented with 15 units which contained 
148 flip-flops, 384 logical followers, and 6350 gating 
diodes. The logical equations in Boolean notation com- 
prised 26 typewritten pages, and the test procedural was 
1 14 pages long. The control signals and the terms of the 
constant generators were checked by a test fixture which 
simulated the essential control signals required to cau.se 
the computer to perform each program step. 
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Fig. 7 — Typical subsystem block diagram. 

Subsystem Test 

After the component has been completely evaluated, 
the next step for system completion is to integrate the 
components together into the various subsystems as 
determined by a logical sequential build-up. Fig. 7 
demonstrates an integration of one subsystem consist- 
ing of four components. Simulation equipment needed 
in this phase is less than during component test. The 
example shows control and decoder components that 
have the facility for entering data into a special-purpose 
computer which steps through a wired program cycle 
and stores information of a magnetic drum. Parts of 
this information are used in the control component 
during system operation. This makes the sub-system 
a small closed loop within the system. Logical tie-in 
and timing errors can be found and solved during 
this part of system completion. Simulation equipment 
for subsystem test usually consists of inhibiting signals 
that affect the closed loop operation and generate all 
those other signals which are necessary to make the 
loop operate. In the example shown, X and Y coordinate 
data in Gray Code, simple operator control buttons, 
and radar antenna position signals were the only signals 
needed to be simulated. Parts of existing component 
test fixtures can be used during subsystem test as they 
contain the necessary simulation equipment. 

System Test and Evaluation 

This phase is the culmination of all the test and evalu- 
ation effort that has been performed previously. All the 
elements, units, and components have been proved to 
perform within the framework of the several subsystems 
and now it is necessary to prove complete system opera- 
tion. This is done in two phases. Since the final military 
installation is a vehicle that has limited working space, 
a laboratory mock-up is provided. This mock-up as 
shown in Fig. 8 simulates the trailer installation as 



Fig. 8 — System mock-up fixtures. 



Fig. 9 — Final system installation. 


closely as possible, yet provides ample working space for 
many more of the engineering personnel so that much 
system testing and trouble shooting can be carried on 
simultaneously in many areas of the system. An addi- 
tional advantage gained by this 2 -step operation is pro- 
vided by the ability to modify the final wiring installa- 
tion as required as problems are encountered in the 
mock-up test phase. All system errors will be discovered 
in the mock-up phase and corrections can be made to 
the equipment before installation into the vehicle. Upon 
completion of the tests in the mock-up area, the equip- 
ment is then transferred to the vehicle and the complete 
system can be integrated with a minimum of personnel 
due to the fact that the system has been completely 
tested and all system errors removed. In fact, the only 
difficulties to be encountered are wiring errors caused by 
human inefficiencies. The vehicle interior working area 
is shown in Fig. 9. 

In the process of the design of the laboratory test 
equipment, many of the simulation concepts evolved 
are readily useful and on occasion can be incorporated 
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into the system as self-test features. This equipment can 
be utilized in the initial system evaluation as well as 
later during normal test modes of system operation. 
Since the end user of this equipment will be military 
personnel, many self-test and automatic indicator de- 
vices were incorporated to decrease the training require- 
ments for operation and maintenance of this equipment. 
Another requirement that is very often specified for 
military equipment is that of providing operation for 
23 out of 24 hours. This fact dictates the requirement 
for having a very rapid means of performing operational 
and preventive maintenance checks by semiskilled 
personnel. 

Military requirements include a controlled complete 
system test to prove that the system meets the initial 
specification. A comprehensive system test plan is most 
often written by the system engineers to test all the 
functions of the system. During this test only external 
system inputs must be simulated; following the test, the 
system is ready for operational use and field evaluation. 

System Test Personnel Training 

The previously mentioned steps in providing for se- 
quential testing of all components up to the complete 
integration for system test and evaluation allow certain 
personnel to acquire gradually system knowledge neces- 
sary to perform efficiently and rapidly the complex task 
of testing such a large system. It is obvious that no one 
individual, no matter how magnificently endowed with 
mental powers, can be expected to understand all neces- 
sary details of such a complex system containing equip- 
ments involving such diverse fields as displays, conver- 
sion, and data processing. A plan was evolved for certain 
specialists to become facile in the over-all system con- 
cepts, yet utilize the certain portions of their specialty 
to a large extent as possible. This control is achieved in 
the following manner: in the initial design phases, each 
component or allied group of components is assigned to 
a cognizant circuit engineer whose responsibility during 
this phase is to design the logical circuitry for the com- 
ponent, or in the case of the display subsystem, to imple- 
ment the original specifications. Once these data have 
been released for equipmenting and packaging, this 
same engineer proceeds to design and build the neces- 
sary unique test equipment for. component evaluation. 
As part of this task the engineer also writes the pro- 
cedurals to be followed in the testing of the component. 
This is the first step in causing the engineer to investi- 
gate external requirements of the component assigned 
to him. As the component is integrated into a subsystem 
area, it is necessary for the engineer to become more 
familiar with the input-output requirement of the adja- 
cent components, so that, while he remains a specialist 


for his assigned component, his system knowledge must 
perforce increase because of interdependency of the com- 
ponents in the subsystem. Since all components have 
been completely tested, there need be only one engineer 
now assigned to each subsystem. The remaining com- 
ponent engineers, however, are available for consulting 
as needed. When all the components are finally inte- 
grated as a system, there remain but a few engineers 
necessary for systems testing, each with a broad knowl- 
edge, rather than many component engineers with lim- 
ited specialized knowledge. Final installation can be 
completed more efficiently with a minimum of personnel. 

Conclusions 

It is apparent that any complex system can be tested 
and evaluated by a step-by-step instrumentation. Pro- 
viding the necessary special-purpose instrumentation 
has proved to be more rapid and economical than the 
accumulation of general-purpose testing devices. In the 
testing of special-purpose computer components within 
a system, there are many instances in which general- 
purpose instrumentation devices would not suffice, no 
matter how much and how varied the instruments could 
be interconnected. In each of the stages of the system 
integration, particular classes of errors and failures can 
be uncovered. y During element tests, electronic part 
failures and mechanical errors are discovered and cor- 
rected. After element testing, each element is considered 
operative and the troubles found in unit tests cannot be 
attributed to the elements. During unit testing, logical 
and timing design errors can be uncovered and intra-unit 
connections are ascertained to be correct. At the com- 
pletion of the unit test, each unit is considered to be 
completely operative. Therefore, during the component 
test phase, any difficulties discovered cannot be at- 
tributed to the unit, but rather to logical tie-in errors 
between units and inter-unit wiring. Similarly, the prob- 
lems within the subsystem test are related to only those 
difficulties encountered in integrating more than one 
component because of the completeness of the compo- 
nent evaluation. System testing is merely an extension of 
the previous statements, but now referring to problems 
encountered in integrating subsystems. The sequential 
building of test complexity gives us the advantage of 
solving small problems first before becoming involved 
with the intricacies and troubles inherent in any large 
system integration. 

Finally, the experience of the personnel involved in 
the test build-up enables a better understanding of the 
system operation, thereby decreasing the time required 
to integrate a large system made up of many discrete 
and special components. 
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Automatic Failure Recovery in a Digital 
Data-Processing System 

R. H. DOYLE, f R. A. MEYER, f and R. P. PEDOWITZf 


Introduction 

P ERFECT reliability in digital computers has not 
yet been achieved by simply designing ruggedness 
into the equipment components. Nevertheless, it 
is essential for a computer to perform dependably under 
all conditions. In certain computer applications, errors 
resulting in unscheduled maintenance delays can be tol- 
erated, but only at the cost of expensive computer time. 
In some special military and civil applications, such as 
the SAGE system and air-traffic control systems, poor 
equipment reliability can be disastrous since input in- 
formation not processed when the system is inoperative 
can become obsolete during the time required for man- 
ual recovery. 

Although it is virtually impossible to guarantee that 
failures will never occur, it is possible to maintain high 
over-all reliability of the system by immediately recov- 
ering from these failures with a negligible loss of opera- 
tional time. 

The FIX program was designed to effect automatic 
recovery from failures either by 

1) reinitiating the operation that failed, 

2) preventing the operational program from process- 
ing incorrect data, or 

3) determining the effect that a particular failure 
would have on a word of information and then 
modifying the information to compensate for this 
failure. 

The error-detection circuitry of the computer is relied 
upon to indicate the existence of an error in computer 
operations. When an error is detected by this equip- 
ment, the FIX program will be operated in an attempt 
to diagnose the failure and to compensate for it. 

Although FIX was specifically designed to work with 
the Air Defense Program of the SAGE Computer, the 
technique employed may be modified for other opera- 
tional or production systems. 

Several other methods for maintaining system relia- 
bility have already been developed. Some of these meth- 
ods will be briefly outlined in the preliminary section of 
this paper, followed by a detailed description of the 
structure and operation of the FIX program. 

Reliability Techniques 

In a complex computer system, component quality 
standards are necessary but cannot in themselves insure 

f IBM Corp., Kingston, N. Y. 


complete reliability. To approach the goal of high reli- 
ability, a more sophisticated viewpoint has been taken 
in designing both the equipment and the computer pro- 
grams. 

In the SAGE system, for example, the complete cen- 
tral computer has been duplexed, and the two computers 
alternately performed the operational program and a 
standby program on a 24-hour schedule. Special alarm 
circuits provide for alerting the standby computer when 
the active computer breaks down so that the standby 
machine will prepare to assume the active role. A por- 
tion of the standby-computer time is devoted to at- 
tempting to predict potential failure conditions before 
they occur. This technique, known as “marginal check- 
ing,” consists in operating and testing various circuits 
while an abnormal voltage is supplied to them. In this 
simulated aging of the equipment, the potential failure 
spots are anticipated. 

Modern computing equipment is usually designed 
with built-in circuitry 1 that will automatically detect 
the majority of errors that occur during system opera- 
tion. Many operational programs are written to take 
advantage of this circuitry by including alarm-interro- 
gation routines which will automatically repeat any 
operation that generated an alarm. 

Error-checking routines have also been incorporated 
directly into operational programs. 2 In programs where 
it is necessary to store blocks of information on auxiliary 
storage drums or tapes before reusing it, the accuracy of 
the transferred information may be checked by com- 
paring the arithmetic sum of the block before it is 
stored to a similar sum obtained after the block is 
brought back from storage. If the two check sums are 
not equal, the reliability of the information block can- 
not be depended upon and the program should be re- 
run. If the program is of considerable length, this task 
may be shortened by periodically saving the environ- 
ment of the program as it operates. This will provide a 
convenient recovery point should it be necessary to re- 
generate a particular block of information. 

Elaborate equipment and coding systems, such as the 
Hamming Code, 8 can provide for automatic self-correc- 
tion of errors and for detection of multiple errors. This 

1 C. J. Swift, “Machine features for a more automatic system for 
digital computers,” J. Assoc. Comp. Mach., vol. 4, p. 172; April, 
1957. 

2 J. H. Brown, J. W. Carr, III, L. Boyd, and J. R. McReynolds, 
“Prevention of propagation of machine errors in long problems,” 
J. Assoc. Comp. Mach., vol. 3, p. 348; October, 1956. 

3 R. W. Hamming, “Error detecting and error correcting codes,” 
Bell Sys. Tech. J., vol. 29, p. 60; 1950. 
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is accomplished by dividing the information to be 
checked into groups of bits and by parity-checking each 
group. The groups of bits are chosen in a manner such 
that an error in any bit in the entire word will generate 
alarm indications for a unique combination of these 
groups. Conversely, incorrect parity counts for any 
combination of these groups will uniquely identify the 
erroneous bit in the word. Since the incorrect bit can be 
identified, circuitry can be provided to correct the error. 
This ingenious coding system achieves excellent results, 
but only at considerable expense. Channel capacity of 
the equipment must be increased to provide for enough 
checking bits to represent a number equal to the total 
number of information bits plus the checking bits. 

Although FIX incorporates some of the techniques 
described above, the distinguishing feature of this pro- 
gram is that it achieves automatic failure recovery by 
means of programming techniques after an error has 
been detected by machine circuitry. While variations of 
the FIX concept will be necessary for other operational 
systems, depending upon the error-detection circuitry 
of the computer and upon the form of the operational 
program, this paper will serve to illustrate the general 
principles of the FIX technique. 

Errors in a computer can occur either during the 
actual processing of data, such as sorting, collating, 
arithmetic computations, etc., or during the transfer of 
information between the central computer and the vari- 
ous auxiliary drum storage units. Since the Air Defense 
Program requires a large storage area, it is stored on 
auxiliary drums, and a considerable number of informa- 
tion transfers continually occur during normal opera- 
tions as the various subprograms and their data tables 
are brought into core memory to be operated. It is ex- 
tremely important that these transfers be performed 
correctly; hence a large portion of this paper discusses 
the technique of monitoring and .correcting errors in 
such transfers. 

Errors incurred during either central computer or 
transfer operations may be either transient or “solid” in 
nature. Errors which are due to high stresses of voltage, 
temperature, shock, etc., and which have a low prob- 
ability of recurring, will be referred to as “transient er- 
rors.” Those errors which are a result of a persistent 
equipment malfunction and which can continually be 
expected to reappear whenever the submarginal area of 
equipment is used, will be referred to as “solid errors.” 
The FIX program has achieved a high degree of success 
in automatically recovering from most of the classes of 
errors described above. 

Program Design 

Storage requirements for the present version of the 
FIX program are 50 core-memory registers and 5000 
auxiliary-drum-storage registers. This represents ap- 
proximately 3 per cent of the total storage available in 
the SAGE computer. During any alarm condition, the 
short FIX routine that is permanently stored in core 


memory will save a portion of the operational program 
in order to provide a working area for the main section 
of FIX. This same routine will then read the appropri- 
ate diagnostic FIX routine into core memory. 

Fig. 1 is a flow chart of the logical structure of the FIX 
program. This structure may be analyzed in terms of 
four functions: 

1) Alarm monitoring and control 

2) Diagnosis 

3) Logging 

4) Recovery. 

These functions are closely related and, although the 
above list represents the over-all time sequence of the 
operations to be performed, there will be considerable 
overlapping in the detailed structure. Since the design of 
the FIX program is a function of the makeup of the 
operational program and of the system to be monitored, 
some of the features of the SAGE system, including the 
error-detection equipment of the computer and the 
structure of the Air Defense Program, will be discussed 
during the analysis of the FIX program. 

Alarm Monitoring and Control 

The operation of the FIX program is greatly de- 
pendent upon the means by which FIX can be notified 
of an error occurring in the monitored system. In the 
SAGE computer, this is provided for by error-check- 
ing and alarm-control circuitry. 

Self-checking is performed by the use of parity-code 
generation and checking circuits that determine if the 
correct number of bits in a binary word have been trans- 
ferred from register to register during the normal data- 
processing operation. This is accomplished by increas- 
ing channel capacity to allow for one redundancy bit to 
be contained in the information transferred. As each 
instruction or data word is stored in the computer, it 
passes through a buffer register, which counts the num- 
ber of “one” bits in the word. The parity bit associated 
with each word will be set to a “one” or a “zero” to give 
an odd number of “one” bits in the word, including the 
parity bit. 

The parity bit will then be stored with the rest of the 
word. When this instruction or data word is referred to 
by the program, a parity-check count is again per- 
formed in the buffer register as the word is brought out 
of storage. If the total parity count is not still odd at this 
time, the word is presumed to be incorrect and a parity 
alarm will be generated. If no error is detected, the op- 
eration will continue and a new parity assignment will 
be performed prior to storing the word after it has been 
operated upon. The parity circuitry is used to check the 
correctness of all data transfers that occur in the sys- 
tem. It should be noted that a major shortcoming of the 
parity-checking system is that if two bits in the word 
are altered as the result of some failure, the odd parity 
count will not be disturbed and the error will not be 
detected by the parity circuitry. Such an error might 
remain unnoticed, in which case the final result would 
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Fig. 1 — Flow chart of the logical structure of the FIX program. 
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be incorrect, or it might result in other error indications 
which could be detected. 

Automatic detection of other abnormal conditions in 
the SAGE computer is also provided by circuitry. 
Sometimes because of circuit failure or an undetected 
parity error, or because of a peculiar set of environ- 
mental circumstances unanticipated by the program 
designer, the computer can begin a nonterminating 
cycle of meaningless operations, commonly referred to 
as an “illegal loop.” Similarly, the computer might begin 
an inactive period during which it does nothing but wait 
for some anticipated event. If for some reason the event 
can never occur, the computer will remain in this inac- 
tive condition indefinitely. Special circuitry designed to 
impose time limits on such conditions can, upon sensing 
an illegal delay in computer operations, terminate the 
condition and by means of an inactivity alarm indicate 
to the computer that the delay existed. The inactivity 
alarm will be activated if a pulse is not generated by the 
program at regular intervals or if too many of these 
pulses are generated within a given time period, usually 
about eight seconds. The programmer must, therefore, 
insert the pulse-generating instruction at regular inter- 
vals throughout his program if he intends to use this 
circuitry. If the program operates normally, the pulses 
will be generated at regular intervals. If the program is 
“illegally” delayed in a routine, or if it continuously 
loops through a few instructions, either too few or too 
many signals will be generated, and the inactivity alarm 
activated. 

Finally, in certain instances an error in a series of 
arithmetic operations may result in the attempted de- 
velopment of a sum or quotient which has increased 
in size beyond the physical limits imposed by the 
register capacity of the computer. This condition, too, 
can be sensed by machine circuitry in the SAGE com- 
puter and indicated by means of an overflow alarm. 

The programmer can choose various modes of opera- 
tion by using switch settings when planning the reac- 
tions of the SAGE computer to these alarm conditions. 
These options can be set to have the computer auto- 
matically 

1) stop on alarms, 

2) branch on alarms, or 

3) continue on alarms. 

Under option 3 the program retains the ability to 
interrogate the alarms at some convenient time before 
taking any automatic action. 

The mode of operation used by the FIX program was 
determined by the nature of the errors that would be 
encountered. Certain types of computer malfunctions 
demand immediate transfer of control to the FIX pro- 
gram. For example, if there is a parity alarm when the 
computer refers to its internal memory for a new in- 
struction step or operand, further operational steps 
would be useless and might even destroy information. 
An inactivity alarm, too, will cause an immediate trans- 


fer, since to continue in this case means to continue the 
abnormal function. The overflow alarm can also cause 
an automatic branch to FIX, but this feature is de- 
signed so that the alarm may be suppressed in the opera- 
tional program when it is known that overflows may 
occur during normal operation. 

An automatic transfer of control to FIX is effected 
by setting the core memory parity, inactivity, and over- 
flow alarm switches in the “active” position and the 
stop-branch switch in the “branch” position. 

Transfers of data between core memory and mag- 
netic drums may be monitored in another manner. 
Erroneous information that might be included in such 
a transfer cannot adversely affect the computer until 
used. Therefore, a drum parity alarm need not cause 
an immediate branch to the FIX program. Instead, the 
drum-parity-alarm switch is set to continue on alarm. 
At the conclusion of every block transfer, FIX checks 
the drum-parity-alarm indicator by means of a pro- 
gram instruction. Since the Air Defense Program was 
designed so that all transfers are controlled in one sec- 
tion of the program, the insertion of one interrogation 
instruction is the only modification of the operational 
program necessary to enable FIX to perform its entire 
monitoring function. If the alarm indicator is sensed in- 
active, there is no change in the normal sequence of 
events in the operational program. If at the end of a 
block transfer of data into core memory the appropriate 
alarm indicator is tested and found to be active, a pro- 
grammed branch to the drum recovery section of FIX 
is effected. 

Diagnosis 

At this point FIX will attempt to perform all diag- 
nostic work necessary for recovery. Where time per- 
mits, FIX will also perform several diagnostic opera- 
tions that are desirable for corrective maintenance 
studies. In all cases the results will be saved for logging 
and, depending on the circumstances, they may also 
be displayed immediately. 

In the event of any type of alarm, initial FIX action 
would save the contents of all the computer registers, 
such as the accumulator, index registers, buffer register, 
program counter, etc., as they existed at the time of the 
error. This information is used as an aid to diagnosis, as 
part of the record of the error for maintenance purposes, 
and also to enable FIX to restore the environment of 
the Air Defense Program prior to effecting a recovery. 
The type of error, such as would be indicated by a drum 
parity alarm or memory parity alarm, will be deter- 
mined by considering the mode of entry to the FIX pro- 
gram and by sensing the various alarm indicators. 

If there has been a memory parity alarm, FIX will 
refer to the program counter setting as it was at the 
time of the error and will locate the incorrect informa- 
tion that was being operated upon at that time. 

When the erroneous information is referred to a sec- 
ond time, a second memory parity alarm may or may 
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not be generated. Considering the case where a second 
parity alarm is not generated, FIX will continue its 
diagnosis by comparing this instruction or operand to 
the word that was parity-checked in the buffer register 
at the time of the error. If the contents of the buffer 
register match either the instruction or data word, FIX 
concludes that there was a false parity error, i.e., an 
error in the parity-checking circuitry itself, and that 
the operation was in fact completed correctly. 

If upon comparing the buffer register to the memory 
register FIX finds that the buffer register was com- 
pletely zero at the time of the error, this would indicate 
that the alarm was probably due to a failure to get a 
start memory pulse and that no operation had begun 
when the alarm was generated. 

Finally, a condition may arise where the buffer regis- 
ter is neither all zero nor equal to the instruction or data 
word in memory that supposedly generated the alarm. 
This would indicate a memory readout failure and an in- 
correctly completed operation. If a second parity alarm 
is generated on the second reference to the instruction or 
operand in core memory, the error is considered genu- 
ine, and, once again, the operation could not have been 
completed correctly. 

The results of the investigation of each memory 
parity error are included in a record for maintenance 
purposes and will also serve as a guide to proper recov- 
ery action. No diagnostic action is taken in the event of 
an inactivity or overflow alarm other than saving the 
contents of the computer registers, and recording the 
type of alarm and the alarm exit location for the main- 
tenance records. 

Errors incurred during the transfer of information 
from the magnetic drums to the central data-processing 
unit are treated according to the class of drum involved. 

Input status drums represent the supply of new in- 
formation to the computer from an external source, such 
as a radar site. Under ordinary circumstances input data 
cannot be stored on a status drum by a program, nor is 
it possible to transfer the same information from a status 
drum to the central computer more than once. Conse- 
quently, a status drum is not normally available for 
complete testing and diagnosing by FIX without undue 
delay of the operational program. 

Recovery from status-drum errors will vary according 
to whether the failure was transient or solid. Therefore, 
when a status-drum error is detected, FIX will examine 
the block of transferred information in core memory 
and, on the basis of the number of errors found, classify 
the failure as transient or solid. An erroneous status- 
drum transfer is classified as a solid failure if the number 
of errors contained in that transfer is more than five (an 
arbitrary figure). Further diagnosis for recovery and 
maintenance consists in determining the identity of the 
failing drum-input channels and the total number and 
frequency of similar errors. 

Addressable drums serve as an auxiliary information- 
storage area. All data transfers to and from addressable 


drums are performed under program control. Address- 
able drums are therefore readily available for diagnosing 
by the FIX program. 

Upon noting an erroneous transfer of this type, FIX 
will search the block of transferred information in core 
memory for the information in error. The original ver- 
sion of data transferred incorrectly will remain un- 
changed on the drum until it is deliberately replaced 
with new data. For this reason, when an incorrect word 
is found, FIX will locate the original information on the 
drum and repeat the transfer of the incorrect word to 
determine whether the error was transient or solid. If 
the second attempt succeeds, a correct version of the 
word is now properly transferred, and by a comparison 
of the correct and incorrect information, the exact cause 
of the failure may be determined and saved for logging. 
This process is repeated if a second word in the transfer 
is in error. 

The timing requirements of the operational program 
will not permit the luxury of individually treating more 
than two such words solely for maintenance purposes. 
If more than two words in a given transfer are found 
to have been in error, the remainder of the erroneous 
transfer is repeated at once, sacrificing additional diag- 
nostic information for increased speed in recovery. 

If any one of the recovery transfers is not successful 
on the second attempt, the error is considered to be of a 
solid nature. Further diagnosis is necessary, but recov- 
ery can still be achieved. Test data of known structure 
may be transferred over the same channels and checked 
by return transfer. Since the failure is solid, these trans- 
fers will also fail, but this time the exact nature of the 
failure can be determined. FIX maintains a history of 
results obtained in this way in a Learning Table. This 
table is used by FIX to compensate for future errors and 
also serves as a guide for corrective maintenance. 

Fig. 2(a) represents an abbreviated word, consisting 
of five bits plus a parity bit, which FIX has determined 
had been incorrectly transferred into core memory from 
an addressable drum. The total number of “one” bits, 
including the parity bit, must be odd in order to be cor- 
rect. The parity alarm which identified this word for 
FIX was a result of a check which indicated only that 
an even number of “one” bits was transferred in this 
word. Therefore, further diagnosis is necessary to deter- 
mine which bit has been modified in transit. 

Fig. 2(b) illustrates the standard method of testing 
the transfer channels to and from an addressable drum. 
By using a pattern of all “ones” and then of all “zeros,” 
the channels may be tested for evidence of bit modifica- 
tion. This method, however, precludes the possibility 
that a unique pattern of bits in a word contributed to the 
failure of one particular channel. Investigation has 
established that failures may sometimes be uniquely 
associated with one word pattern and not with another. 
In the critical circumstances under which FIX is acti- 
vated, it is felt that the extreme importance of being ac- 
curate has justified using a more detailed testing pro- 



164 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 



TEST PATTERN (1 '* or 0'*) 
WRITTEN ONTO DRUM. 

ALL BITS ARE BEING TESTED 


PARITY BIT IS SET TO 
ZERO BY COMPUTER 
PRIOR TO TRANSFER 



TEST WORD READ INTO CORE 
MEMORY TO TEST FOR FAILURE. 
BIT D HAS FAILED TO RETURN 
CORRECTLY 


PARITY BIT STILL' ZERO 
RESULTS Itf EVEN NUMBER OF 
ONE BITS, INDICATING 
WORD INCORRECT 



(b) 


MODIFIED ORIGINAL WORD 
WRITTEN ONTO DRUM. 

BIT D IS BEING TESTED 


CORRECT PARITY COUNT 
ASSIGNED BY COMPUTER 
PRIOR TO TRANSFER 



TEST WORD READ INTO CORE 
MEMORY TO TEST FOR FAILURE. 
BIT D HAS FAILED TO RETURN 
CORRECTLY 


PARITY BIT STILL ONE. 
RESULTS IN EVEN NUMBER OF 
ONE BITS INDICATING WORD 
INCORRECT 



(C) 

Fig. 2 — (a) Even parity count indicates word incorrectly transferred but does not indicate which bit failed, (b) Standard test pattern'tech- 
nique for checking transfer channels comparing “before” and “after” words discloses the discrepancy, (c) Fix technique using original 
word [see 2(a) above] to check one channel at a time. The bit discrepancy is double checked by “before” and “after” comparison. 
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cedure than the common test pattern. FIX checks the 
transfer channels by using the original pattern of bits in 
the incorrect word as nearly as possible. 

When a solid failure is detected, FIX first checks the 
Learning Table for a history of solid errors on this par- 
ticular drum. If such records exist, FIX checks each 
transfer channel indicated by the Learning Table as 
having failed before. Fig. 2(c) illustrates a channel being 
tested in this manner. A bit which was suspected of hav- 
ing been lost in transit is changed to a “one” in core 
memory, and the entire word is then transferred to and 
from the drum to test this channel. If this bit fails to re- 
turn as it was sent out— in this case as a “one” — this 
discrepancy is recorded. The Learning Table search will 
usually take no more than about 50 msec. If only one 
bit is found to be erroneous in this word at the comple- 
tion of the Learning Table test, the results will be used 
to effect a recovery. If the Learning Table examination 
is not fruitful, recovery may still be achieved by further 
diagnosis. 

In this event the next step would be for FIX to con- 
duct a complete examination of the word. This is basi- 
cally the same as the Learning Table test, except that the 
entire word is tested for evidence of failure instead of 
only those channels indicated by the Table. Each bit in 
succession is complemented, transferred to and from 
the drum along with the rest of the word, tested for 
evidence of modification in transit, and restored to its 
original state. 

Any bit that fails to return in the same form as it 
was sent out is recorded. The bit-by-bit findings are 
accumulated until the end of the examination, which 
takes about one second. If the complete examination 
discloses that only one bit has failed in this word, the 
results will be used to effect a recovery. 

Logging 

All information gathered by the FIX program will be 
either printed on the teletype monitor, displayed imme- 
diately, or recorded in the Learning Table. The recov- 
ery that will follow is meant to improve the system reli- 
ability, not to shield equipment failures. If failures were 
not logged when recovery was achieved, the equipment 
could deteriorate with age until, without warning, 
catastrophic failure occurred. 

During any alarm condition where automatic trans- 
fer occurs, FIX saves the current contents of all the 
computer registers for logging. The various alarm indi- 
cators will be tested to determine which type of alarm 
occurred. This information, together with the identity 
of the operational routine interrupted by the alarm, the 
data that was being processed at that time, and the de- 
tails of the error as diagnosed will be logged on the tele- 
type printer immediately after the error occurs. A rec- 
ord of the number of such errors will also be maintained 
on a display. 


Status- and addressable-drum errors are internally re- 
corded in the Learning Table and are also displayed as 
they occur. Each time a status-drum error occurs, the 
drum field in error and its input channel are recorded 
and displayed together with the total number of such 
errors recorded up to this time. The record and display 
for addressable-drum errors includes the drum field, 
failing transfer channel and the nature of the failure, 
i.e., whether the erroneous bits were “ones” or “zeros,” 
whether the failures were solid or transient, and the 
total number of such errors. 

Enough pertinent information concerning each fail- 
ure incident is logged to permit maintenance study 
teams to attempt to duplicate the trouble and keep de- 
tailed statistics on the reliability of the circuits in the 
system. Maintenance personnel can resolve machine 
difficulties only if this kind of logging is done. As experi- 
ence is gained, the difficulties will recur less frequently, 
since equipment and program design improvements 
will be suggested by the statistics. 

Recovery 

The function of the recovery sections of FIX is to 
perform all operations necessary to restore control of 
the computer to the operational program. The choice 
of the method depends on the following factors: 

1) The nature of the interrupting malfunction 

2) The results of the diagnosis 

3) The time spent in detecting and diagnosing 

4) The number and frequency of this type of failure 
incident. 

Some failure incidents in the central computer may 
be rectified by restoring the contents of the computer 
registers and internal memory to their original values, 
as saved by the alarm monitoring and control sections 
of FIX, and then transferring computer control back to 
the operational program at the point of interruption. 
This recovery method is most efficient, requiring up to 
about 30 msec, and is used, wherever possible, in the 
case of memory parity errors. 

If diagnosis indicates that the operation was com- 
pleted correctly, as in the case of a false parity error, the 
environment of the Air Defense Program will be re- 
stored, and recovery will be effected by reinitiating 
operations with the next program step following the one 
that operated at the time of the error. The exception to 
this would be if the interrupted instruction involved a 
transfer operation, in which case program control would 
be returned to the same transfer instruction. 

If the instruction was never completed correctly, as 
in the case of a failure to get a start memory pulse or a 
memory readout failure, recovery will be attempted by 
reinitiating the Air Defense Program with the instruc- 
tion that originally failed. 
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When an error is diagnosed as a genuine memory- 
parity error, the erroneous word must be corrected in 
core memory before the operational program can be 
resumed. At the present time FIX cannot automatically 
regenerate a correct version of a faulty word in core 
memory. However, since all of the program instruc- 
tions and most of the data that are used by the Air De- 
fense Program are stored on magnetic drums, FIX will 
attempt to locate a correct copy of a word in auxiliary 
storage and substitute it for an erroneous word in core 
memory. Whenever this is possible, recovery will then 
be achieved by reinitiating the Air Defense Program 
with the correct version of the instruction that orig- 
inally failed. 

If any of these methods are not successful in achieving 
recovery, i.e., if the identical failure is immediately en- 
countered, or if these methods are not feasible, as in the 
case of inactivity or overflow errors, an alternate method 
of recovery is available. 

The Air Defense Program consists of a series of sub- 
programs, each of which operates in its turn upon the 
latest input data fed to it. The entire process is an 
iterative one. After the last program has been com- 
pleted, the first program will be called on to repeat its 
function upon the latest available data. 

When an operation cannot be resumed at the point 
of interruption, recovery can often be achieved by re- 
initiating the Air Defense Program at the beginning of 
a cycle or frame of operations so that completely new 
input data may be processed. The startover procedure 
takes approximately five seconds. Thus, an abnormal 
condition which was the result of a transient failure will 
not degrade the performance of the system. A more 
serious or solid failure in the central computer will of 
necessity cause repeated restarts of the operational pro- 
gram. It is left to the discretion of the operator to im- 
pose limits on the number or frequency of the attempts 
to recover in this manner. 

FIX does not distinguish between central computer 
errors that occur while the Air Defense Program is op- 
erating and those that occur during FIX operation. If a 
second error is encountered while one error is being cor- 
rected, the later error will take precedence. FIX will at- 
tempt to correct a central computer error within itself 
in exactly the same manner as an error occurring within 
the Air Defense Program. The original error, however, 
will then be disregarded, recovery of the Air Defense 
Program being achieved via startover. Of course a seri- 
ous error in a vital section of FIX will preclude auto- 
matic recovery, and manual intervention will be neces- 
sary. 

Recovery from transient status-drum errors is 
achieved in another manner. In the event of a status- 
drum failure, the testing procedure is limited by the 
fact that there is no practical way to make an experi- 
mental transfer between the central computer and a 


status drum without a prolonged interruption of the Air 
Defense Program. Since FIX cannot determine exactly 
what failed in this type of transfer, it is not possible to 
estimate what the data should have been. 

The solution is to render the erroneous data harmless 
by eliminating them from core memory, adjusting the 
Air Defense Program’s records to compensate for the de- 
crease in the amount of input information to be proc- 
essed, and then returning to the logical operation in the 
main program that would normally follow the status- 
drum transfer. The momentary loss of some input data 
to the computer from a radar site, for example, will 
have no more effect on the Air Defense Program than 
would the slight interruption of radar fixes that are ex- 
pected to occur during normal operation. Such tempo- 
rary losses, or “miss fixes,” as they are commonly called, 
are not unusual and the Air Defense Program provides 
for this by extrapolating or filling in for missing radar 
fixes when they occur. In this manner an accurate plot 
of the velocity of a hostile ship can easily be maintained 
despite the fact that a few positional fixes are missing, 
if the available fixes are dependably correct. A much 
smaller number of incorrect fixes can destroy the ac- 
curacy of a course plot if no means is provided to prevent 
these fixes from being included in the plotting computa- 
tions and therefore the elimination of incorrect input 
data is much more desirable than treating such informa- 
tion as valid. 

FIX cannot allow the situation to continue where a 
large number of consecutive errors reduces the flow of 
information to the Air Defense Program below an ac- 
ceptable minimum. An excessive number of errors in one 
status-drum transfer will require that recovery be at- 
tempted by reinitiating the Air Defense Program at the 
beginning of a new frame of operations so that new 
input data can be called for and processed. 

A dynamic display of all facts which are pertinent to 
this type of error is up-dated each time an error occurs. 
By observing the display, maintenance men may be 
able to determine the input channel from which most of 
the errors are coming. They may thus be able to elimi- 
nate or reduce the quantity of status-drum errors by 
substituting a spare input channel without interrupting 
the operational program. 

Recovery from addressable-drum failures may be 
achieved by modifying a bit in the incorrect word or 
words in core memory according to the nature of the 
failure. Transient errors are corrected during the diag- 
nosing operations. 

In a solid failure during the transfer of a block of in- 
formation to core memory from an addressable drum, 
many words may be expected to have transferred in- 
correctly. In an operational period, time does not permit 
that FIX be allowed to diagnose each word before cor- 
recting it. When FIX is satisfied that the Learning 
Table test or the complete examination has disclosed 
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the failing transfer line for one word, it will use this in- 
formation to correct this word and the remaining words 
in error in the same transfer. In Fig. 3(a), let us say that 
Words 1-10 were incorrect in the block of transferred 
information shown. Suppose that a complete test of 
Word 1 indicated that channel A had failed, and that 
the bit in position A should have been a “one.” After 
correcting Word 1 in core memory as shown in Fig. 
3(b), FIX would continue to examine the remaining in- 
correct words. If a check of bit position A in each in- 
correct word indicates that it was possible that the iden- 
tical error occurred in each incorrect word, these words 
would also be corrected in the same manner as Word 1. 
[See Fig. 3(b), Words 2-7.] 

In an actual transfer, several thousand words might 
be involved. If a solid failure occurred, the number of 
words in error could be expected to be quite large. Con- 
sequently, if the error initially found in the complete 
test did not apply to all of the incorrect words in a 
transfer, it is felt that this fact would soon be obvious. 
Continued examination of the remainder of the incor- 
rect words should reveal at least some words which could 
not have failed in the same manner. In Fig. 3(a), Words 
8 and 10 now contain a “one” in bit position A. There- 
fore, that bit could not have dropped during the original 
transfer. This would indicate that another channel had 
failed in transferring these words and might also have 
failed during the transfer of any of the previously “cor- 
rected” words. This inconsistency would invalidate the 
“corrections” made earlier to these transferred data. In 
such a situation, recovery is effected by restarting the 
operational program at the beginning of a frame so 
that new input data may be processed. If more than one 
channel is found to have failed, this information will be 
immediately logged prior to initiating a startover of the 
Air Defense Program. The maintenance men may then 
determine whether or not it will be possible to permit 
the Air Defense Program to continue to process new in- 
put information. 

The main section of the FIX program is also stored on 
an addressable drum. If an error is encountered in read- 
ing FIX into core memory, the diagnostic and repair 
sections cannot be used to correct this error. Instead, 
the permanent FIX routine in core memory will initiate 
a startover of the Air Defense Program. If this tech- 
nique is unsuccessful for recovering from an error, man- 
ual intervention will be necessary. 

Results 

After a six-month study of failures during 87 missions 
of the Air Defense Program of the SAGE computer, it 
was determined that the causes for failures were dis- 
tributed as follows: 

Drum parity (status and addressable) S3 per cent 

Memory parity 10 per cent 

Inactivity 21 per cent 

Miscellaneous (power failures, stopped by operator, etc.) 16 per cent 
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Fig. 3 — -(a) A portion of a block of data transferred into core memory 
from an addressable drum. Words 1-10 were incorrectly trans- 
ferred. (Note the even parity count in each erroneous word). An 
examination of Word 1 indicated that bit A had dropped in 
transit. This bit would be corrected in Word 1 and all other in- 
correct words which could have failed in the same manner, 
(b) Bit position A has been corrected in Words 1-7. The incon- 
sistency in Word 8 prevented further attempts at recovery in this 
transfer. 

On the basis of these results, FIX theoretically is capable 
of automatic recovery from 84 per cent of all failures 
occurring in the period studied. 

Shortly after this study FIX was employed for an ex- 
tended number of evaluation missions on the SAGE 
computer. 

During this large trial period of computer operation, 
FIX provided automatic recovery from more than 92 
per cent of the failures. Of the remaining errors, only 
about 2 per cent require unscheduled maintenance, the 
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TABLE I 

Summary of FIX Action for Each Type of Error 


Type of Error 


Recovery Procedure 


False memory parity: within Air Defense Program and/or FIX. 

Fail to get start memory pulse, or other memory readout failure: 

in Air Defense Program and/or FIX. 

Genuine memory parity: in Air Defense Program and/or FIX. 


Inactivity, overflow, genuine memory parity that cannot be cor- 
rected, solid memory parity : within Air Defense Program and/or 
FIX. Solid status-drum failure (an excessive number of errors 
in one transfer) in Air Defense Program only. Drum parities 
while bringing in FIX: FIX only. 

Transient status-drum parity: in Air Defense Program only. 


Transient addressable-drum parity: in Air Defense Program only. 
Solid addressable-drum parity : in Air Defense Program only. 


Any catastrophic error from which FIX does not successfully re- 
cover. 

Errors which do not result in memory parity, drum parity, inactiv- 
ity or overflow alarms. 


Reinitiate program with next instruction following the one that oper- 
ated at the time of the alarm. 

Reinitiate program by repeating instruction that operated at the time 
of the alarm. , 

Replace incorrect word with good copy from auxiliary storage drum 
and reinitiate program by repeating instruction that operated at the 
time of the alarm. 

Reinitiate Air Defense Program at the beginning of a new frame of oper- 
ation. FIX imposes no limits on the number of “startovers” that 
may be initiated in attempting to recover. The operator must deter- 
mine if an excessive number of restarts is cause for manual interven- 
tion. 

Eliminate erroneous information from core memory, adjust records of 
Air Defense Program and continue operations at point following 
transfer. 

Repeat transfer. Re-enter Air Defense Program at point following 
transfer operation. 

Correct erroneous words in core memory. Restart Air Defense Program 
at point following transfer. If diagnosis is inconclusive for purposes 
of correcting error, restart the Air Defense Program at the beginning 
of a new frame of operation. 

Manual intervention. 

None; FIX not activated except by the alarm circuitry of the computer. 


other 6 per cent being due to operator and program 
errors. 

It appears that the inclusion of FIX in the tests in- 
creased computer efficiency. In addition to the improve- 
ment in system performance achieved with FIX, a long- 
term gain should be realized in basic equipment reliabil- 
ity and useful operational time because of the increased 
accuracy of the maintenance information supplied by 
FIX. 

In summary, the advantages offered by FIX are these: 
Recovery from most errors is accomplished auto- 
matically, almost immediately, . and accurately, thus 
assuring the correct results with a negligible amount of 
lost operational time. Table I is a summary of FIX ac- 
tion for each type of error. 

The Learning Table record permits more immediate 
correction of some errors the second time they occur. 


Solid errors in addressable-drum transfers, for example, 
have been virtually eliminated as a source of reduced 
computer efficiency. 

The logging feature of FIX affords a detailed record of 
all errors exactly as they occurred as an improved aid 
to corrective maintenance. 

Although the advantages and results obtained so far 
have been limited to one specific application in the 
SAGE system, it is felt that variations of the FIX 
concept can be successfully applied to other operational 
and production programs written for a data-processing 
system. 
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A High-Speed Data Translator for Computer 
Simulation of Speech and Television Devices 

E. E. DAVID, jE.f, M. V. MATHEWSf, and H. S. McDONALDf 


Introduction 

T HIS PAPER describes a data translator which, 
when used with a digital computer, permits simu- 
lation of speech- and television-processing devices. 
The usual function of such devices is to realize ef- 
ficient codings of information-bearing signals thereby 
economizing on transmission requirements. The merit 
of any particular coding must be based upon human 
judgment. Such subjective reactions can seldom be pre- 
dicted by analytical means. For example, the evaluation 
of speech-transmission systems relies heavily upon 
quality and intelligibility tests using human observers. 
Thus, instrumentation of operating models to generate 
test speech samples invariably becomes a part of any 
such evaluation. In addition to transmission codings, 
signal-processing devices have been used to investigate 
automatic recognition of abstractions such as phonetic 
elements in speech or geometrical figures. Such recog- 
nition functions typically require even more extensive 
instrumentation. 

Models for studying coding and recognition schemes 
currently being considered are complicated, involve 
great logical complexity, and require large flexible 
memories. The cost of construction is high not only in 
money, but in the time and energy of the researcher. 
Several years are often involved in adequately testing 
relatively simple notions, and systematic evaluation of 
some well-founded proposals has been delayed more 
than 20 years. Clearly, the need for extensive instru- 
mentation is a bottleneck limiting creativity in this 
area. 

It is now possible for most laboratory purposes to 
eliminate the construction of much complicated equip- 
ment by means of simulation with general-purpose 
digital computers. A computer can be made to act the 
part of any specified device for the duration of an ex- 
periment and requires only a changed program to 
transform its action to simulate a new device. Not only 
are savings in money and time realized, but many other 
equally important advantages can be achieved. For 
instance, the range of ideas which can be considered is 
greatly expanded, the simulated device and the data can 
be precisely controlled, and device parameters can be 
easily and flexibly modified. 

Simulation of speech and television signal processing 
devices requires an input and output translator capable 
of delivering such signals into the computer and recover- 

f Bell Telephone Labs. Inc., Murray Hill, N. J. 


ing them from the computer without appreciable degra- 
dation. We have described in a previous paper 1 one at- 
tempt to satisfy this requirement. A block diagram of 
that translator is shown in Fig. 1 and we shall review its 
operation and limitations before describing a new high- 
speed version. The analog input is filtered to a band 
limit of W cps, and then sampled 2 W times per second. 
An analog-digital converter codes each sample into 11 
binary digits which appear in parallel at its output. Ten 
of these digits are monitored by the recording and play- 
back electronics whose function it is to arrange the digits 
in the proper format and record them on a magnetic 
tape medium. The initiation, cessation, and timing of 
these events are assured by the control unit. This trans- 
lator is bilateral, recording tapes for computer input, 
and decoding tapes written by the computer. Thus the 
processed computer output can be displayed audibly or 
visually for subjective evaluation. 

/" 


BAND LIMITED 
INPUT OR SAMPLED 
OUTPUT 



Fig. 1 — Original data translator. 


The operating parameters of the translator are set 
by the tape format, which of course is in turn deter- 
mined by the computer input requirements. Our trans- 
lator is designed to match the IBM 704 computer 
which calls for a seven-track nonreturn-to-zero record- 
ing with digits recorded simultaneously in each track. 
There need be 200 such seven-digit characters per inch 
on the tape. The translator operates in two modes. In 
mode I, each input sample is represented as a six-digit 
binary number (the remaining four digits are discarded) 
and is recorded in one tape character (only six places 
are available in each character since the seventh place is 
reserved for a parity check digit). In mode II, each input 
sample is represented as a 10-digit binary number and 

1 E. E. David, Jr., M. V. Mathews, and H. S. McDonald, “De- 
scription and results of experiments with speech using digital com- 
puter simulation,” 1958 WESCON Convention Record, pt. 7, pp. 
3-10. 
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is recorded in two tape characters. The five most sig- 
nificant digits, identified as such by a zero in the sixth 
place, occupy the first character, while the five least 
significant digits, identified by a one in the sixth place, 
occupy the second character. 

In either mode I or mode II, the character density 
on the tape together with the linear tape speed fixes the 
input sampling rate. For instance at a tape speed of 50 
inches/second, 50X200 = 10,000 characters per second 
are recorded. In mode II, this figure corresponds to 
5000 input samples/second. Three tape speeds are 
available on the tape transport, and the corresponding 
input sampling rates are summarized in Table I. These 
figures show both an upper and lower limit on the sam- 
pling rate. Experience with speech and television sim- 
ulation experiments have shown that these limits are 
inconvenient constraints 2 on the analog-signal band- 
width. For instance, unless the analog signal is subjected 
to a time-scale transformation before its introduction 
into the translator (such a transformation can be ac- 
complished by recording and reproducing the analog 
signal at different tape speeds), a 5000-cps speech wave 
can be represented to only six-digit accuracy. Even 
time-scale transformations do not alleviate these limits 
greatly, since record-reproduce speed transformations 
using analog recorders introduce their own signal degra- 
dations and thereby have restricted application. 


TABLE I 


Tape Speed 

Mode I 

Mode II 

Inches/Second 

Samples/Second 

Samples/Second 

12.5 

2500 

1250 

25 

5000 

2500 

50 

10,000 

5000 


The synchrony of the sampling and recording proc- 
esses in this translator not only places an upper and 
lower bound on the available sampling rate, but in addi- 
tion restricts that rate to a number of discrete values. 
This limitation has likewise proved to be an incon- 
venience limiting the flexibility of the translator. Two 
further shortcomings result from this same basic cause. 

First, for ease of programming signal-processing ex- 
periments, it is desirable to have the digital data 
divided into several fixed-length blocks called “records,” 
each separated by a gap in which nothing is recorded. 
The present translator has no provision for so punc- 
tuating the tape with “record gaps.” Recording must 
proceed continuously during the presence of an analog 
input if it is to be sampled without interruption. Such 
a procedure often results in long recordings which are 
inconvenient for the computer to handle. 

The second shortcoming is of even greater importance 
and arises from irregularities in character spacing on 

2 R. E. Graham and J. L. Kelly, Jr., “A computer simulation chain 
for research on picture coding,” 1958 WESCON Convention Rec- 
ord, pt. 4, pp. 41-46. 


output tapes written by the computer. These irregulari- 
ties are introduced by the low-inertia, fast start-stop 
tape transports associated with the computer. When 
such tapes are reproduced through the translator, these 
fluctuations appear as “flutter” and “wow” in the 
analog output. Though this effect has been minimized 
by careful selection of tape transports, it remains the 
most severe limit on the fidelity of reproduction. 

To recapitulate, all of the limitations imposed by the 
translator, namely: 1) upper and lower bounds on input 
sampling rate, 2) only discrete sampling rates available, 
3) data blocks on digital tape of arbitrary size, and 4) 
analog output having flutter and wow, result from the 
inherent synchrony between the input-output sampling 
rate and the recording-reproducing rate on the digital 
tape. The remainder of this paper describes the design 
and construction of a new translator which avoids these 
problems by divorcing the two rates. 

System Design of Data Translator 

The limitations in the existing translator have been 
overcome by incorporating a buffer storage between the 
analog-digital converter and the digital tape recorder. 
During recording, the buffer stores the samples of the 
input while the tape recorder is inserting record gaps. 
Thus convenient fixed-length records may be produced 
without interruption of input sampling. During play- 
back, the buffer has sufficient capacity to continue de- 
livering characters (to the digital-to-analog converter) 
during record gaps in the digital tape, thereby assuring 
an uninterrupted flow of output samples. In addition, 
the buffer can smooth the jitter in the flow of samples, 
producing a uniformly timed sequence to the digital-to- 
analog converter. 

A buffer of practical size can hold only a small amount 
of data compared to the amount of data passing through 
it in most applications. Consequently, to prevent com- 
pletely emptying or filling the buffer, the average rate 
of recording samples must be equal to the average rate 
of sampling the input. (This discussion will be limited 
to the recording process in most cases. Extension of 
these considerations to the playback process is usually 
obvious.) Matching the average rates while maintaining 
a uniform character spacing on the digital tape requires 
controlling the average tape speed. This control could 
be accomplished either by servo control of the speed 
or by using a fast start-stop tape mechanism and vary- 
ing the idle period. The latter procedure was chosen be- 
cause it results in a much simpler mechanism. 

A block diagram of the buffered system during record- 
ing is shown in Fig. 2. The system is constructed to 
allow any sampling rate from zero to the maximum 
allowable, to produce constant-length records, and to 
automatically control the start-stop cycle of the tape 
recorder so as to prevent the buffer overflowing. The 
analog signal is sampled, digitized, and the digits trans- 
mitted into the buffer each time an external synchroniz- 
ing signal is applied. A preset reversible counter keeps 
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Fig. 2 — Buffered recording system. 

track of the contents of the buffer, and when sufficient 
samples almost to fill it have been accumulated, the 
counter emits a signal which starts the tape recorder. 
For each character to be recorded, the recorder produces 
an unload pulse which causes the buffer to deliver one 
character’s worth of digits. The unload pulse also 
counts down on the reversible counter and up on a 
record-length counter. The latter, after a preset num- 
ber of characters have been recorded, stops the tape re- 
corder, which then waits until the next “almost-full” 
signal is received to start* recording the next record. 
Buffer loading continues during the recording cycle. 
This procedure requires the buffer to interleave loading 
and unloading operations in any sequence, and possibly 
to load and unload simultaneously. A modified com- 
mercial unit described in the next section meets these 
requirements. 

During its ON cycle, the recorder operates at a con- 
stant speed, thus simplifying the recording electronics 
over that required for a variable speed machine, and 
increasing the reliability. If the number of characters in 
a record is set to be less than the size of the buffer, and 
input sampling rate is less than the rate at which the 
buffer can be emptied, then the input sampling rate is 
entirely independent of the recording rate. Indeed, 
asynchronous sampling can be used equally well. 

The maximum sampling rate depends on the record- 
ing rate, the record length, and the minimum idle time 
of the tape transport. The relation for the maximum is 

recording rate 

maximum input rate = — : 

minimum idle time 

1 -| . . - . 

record writing time 

where the minimum idle time is the minimum time to 
produce one interrecord gap and the record writing 
time is the time to write the characters in one record. 
For the equipment described below, the recording rate 
is 30,000 characters per second and the minimum idle 
time 4.7 milliseconds. Thus for a record length of 1000 
characters, the maximum input rate is very close to 
26,000 characters per second (26,000 mode I samples 
per second or 13,000 mode II samples per second). 

During playback the digital-tape machine supplies 
characters to the buffer and the digital samples from the 
buffer are converted to analog output samples at the 


command of an external synchronizing signal. The op- 
eration sequence is such that one record of characters 
is put into the buffer from the tape initially. Thereafter, 
each time the buffer becomes almost empty, as indicated 
by a preset reversible counter, another record is put into 
the buffer. The tape transport is started by the “almost- 
empty” signal, and stops at the end of each record of 
data. All starting and stopping is thus done in the record 
gaps, and hence no data are read while the tape is ac- 
celerating or decelerating. This control prevents the 
buffer from becoming empty and divorces the rate of 
delivering output samples from the rate at which data 
are coming from the tape. 

The buffered recording system thus overcomes the 
principal limitations of jitter, inflexible sampling rate, 
and record length which harassed the operation of the 
original data translator. In addition, by using a higher- 
speed transport, a substantially faster recording rate is 
achieved. The details of the new translator are de- 
scribed in the next section. 

Data Translator Description 

A photograph of the new high-speed data translator 
to perform the task of recording speech and television 
signals on digital tape is shown in Fig. 3. The unit is 
contained in three racks. The rack at the left houses the 
tape transport and read-write electronics. The center 
rack contains the control circuits and the buffer storage 
unit, while the rack on the right contains the analog-to- 
digital and digital-to-analog converters. 

Many of the components of the new high-speed sys- 
tem are commercially available stock items. The tape 
transport is an Ampex FR-300 Instrumentation Tape 
Transport with a tape speed of 150 inches/second. It 
was found necessary to construct special reading trans- 
lators to attain the desired reliability. The recorder 
produces all features of the IBM tape format including 
lateral parity bit, longitudinal parity character, and 
end-of-file marks and gaps. During playback a clock 
generator which forms a logical “or” of all the seven 
tracks to produce a character synchronizing signal is 
required to provide timing. The IBM format of using 
six data bits plus an odd parity sum makes this unit 
necessary since it is possible to have a character with 
only one “one” in it and that “one” can occur in any 
of the seven tracks. 

The buffer storage unit is a Telemeter Magnetics 
1092-BU-7R buffer which is capable of storing 1092 
seven-bit characters. The buffer is capable of either 
loading or unloading one seven-bit character in 10 jx sec 
but it must not be loaded and unloaded simultaneously. 
In this application, input data must be loaded during 
writing on the tape, so that buffer is augmented by the 
addition of a seven-bit storage register and a guard cir- 
cuit. In the composite unit, loading of the buffer is un- 
changed except it is delayed for 10 /xsec by the guard 
circuit. This circuit establishes a zone which starts 10 
ix sec before and ends 10 fxsec after the actual loading of 








172 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 



Fig. 3 — The new high-speed data translator. 


the data. If an unload is attempted during the time of 
this zone, the actual unloading is delayed until the end 
of the zone. In this manner, the actual loading and un- 
loading of the magnetic cores can never occur within 
10 /xsec of each other. As a result, unloading is no longer 
uniform in time and may be delayed up to 20 /x sec de- 
pending upon the loading conditions. An additional 
storage register removes the 20 /xsec uncertainty in un- 
loading by holding the output data from the magnetic 
core unit until the subsequent unload operation is at- 
tempted. The augmented buffer unit, comprising the 
core storage, the guard circuit, and the seven-bit storage 
register can now be loaded and unloaded simultane- 
ously. However, the changes have reduced the maxi- 
mum rate of cycling of the buffer from 50 kc to 33 kc 
and have introduced a delay of one unload-sample-time 
in the output. 

The analog-to-digital and digital-to-analog conver- 
sions are performed by two Datrac B-611 converters 
manufactured by Epsco, Inc. These units are equipped 
with a sampie-and-hold feature and can sample and 
convert data at any rate up to the tape writing rate. 
Eleven binary digits are available to form two tape 
characters which specify the input signal to 0.05 per 
cent. The FR-300 allows simultaneous recording and 
playback. During recording, the tape is monitored by 
the playback heads and the resultant pulses are con- 
verted to analog samples by the digital-to-analog con- 
verter. Because the buffer is occupied during recording, 
the output samples are not buffered and appear only 
when the tape is in motion, but such a signal is adequate 
for monitoring with an oscilloscope. 


The control circuits are composed of about 130 solid- 
state logic units which perform the functions of data 
sorting, data counting, and control to integrate the tape 
transport, the buffer, and the converters. This unit, 
along with the packaging of the other components, was 
designed and constructed to our specifications by the 
D. G. C. Hare Co. of New Canaan, Conn. A great deal 
of care was taken to prevent uncertainties from creeping 
into the time of input sampling, the output sampling 
rate and duration of the output samples. During play- 
back, the sample duration is controlled by the time be- 
tween positive and negative zero crossings of the ex- 
ternal synchronizing signal. The operator supplies this 
signal so presumably the duration can be made as pre- 
cise as desired. Baring any digital errors, the only signal 
distortion is the quantizing noise, which is less than 
0.05 per cent and timing uncertainties which are func- 
tions only of externally supplied synchronizing signals. 

There are several functions which are manually exe- 
cuted by means of front panel controls. The digital tape 
can be punctuated by an end-of-file mark by means of 
“Write End of File” control. Any residue of data re- 
maining in the buffer at th£ end of a recording can be 
dumped on the tape by the “End of Data” button. Dur- 
ing playback the unit will read out data until an end of 
file is sensed, at which time it will stop and disconnect 
the external synchronizing signal as if the stop button 
had been depressed. A reset control clears the buffer and 
all of the logic. Front panel indication of buffer full, 
buffer empty, and parity error are provided. 

Effectiveness of New Translator 

The principal advantage of the new translator is its 
high speed and accuracy. This unit is capable of record- 
ing data in computer format on tape moving at 150 
inches/second. 

The unit has a channel capacity of 150,000 binary 
pulses per second. These pulses are used to represent 
analog signals to either 11- or 6-bit accuracy at sampling 
rates up to 12.5 and 25 kc, respectively. 

The rate of sampling input data is independent of 
digital tape speed and data density on the digital tape. 
The rate of output samples from digital tapes is also 
independent of these factors. It is possible, for instance, 
to record a speech wave sampled 10,000 times a second 
and play it back at two samples per second into a pen 
oscillograph. 

There are no discontinuities in gathering data, yet the 
translator produces a digital tape with convenient punc- 
tuation gaps at frequent intervals. This punctuation is 
removed automatically when reproducing the signal. 
There is no wow or flutter in the output due to fluctua- 
tions in the tape motion. 



1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


173 


Some Experiments in Machine Learning 

HOWARD CAMPAIGNEf 


"1 ^ VER since the development of automatic se- 

l quence computers it has been possible for the 
^ machine to modify its own instructions, and this 
ability is the greatest single faculty in the complex that 
tempts the term “giant brain.” Friedberg 1 demonstrated 
a new technique in the modification of instructions; he 
allowed the machine to make alterations at “random,” 
and lent direction to the maneuver by monitoring the 
result. This technique is far from being a feasible way to 
program a computer, for it took several hundred thou- 
sand errors before the first successful trial, and this was 
for one of the simplest tasks he could imagine. A simple 
principle of probabilities shows that a task compounded 
of two tasks of this same complexity would take several 
hundred thousand times as long, perhaps a million com- 
puter hours. It is the object of this study to examine 
techniques for abbreviating this process. 

The work reported here is not complete, nor is it 
likely to be for several years. The field is immense, the 
search proceeds slowly, and there are few clues as to 
where to look. This paper is, therefore, in the nature of 
a preliminary report. 

The memory of a computer can be pictured as a piece 
of scratch paper, ruled into numbered cells, on which 
notes can be written or overwritten. I set aside a portion 
of this memory in which the computer is supposed, 
somehow, to write a program. Some of this dedicated 
space is for dataand some for instructions. I programmed 
a generator of random numbers, and instructed the com- 
puter to use these numbers to write a program (and 
later to modify a program already written). The 
method of using the random numbers is such that all 
addresses generated refer to the data, and all instruc- 
tions generated are from an approved list. After a little 
thought one sees that by putting sufficient limitations 
on what the computer is allowed to write the result will 
be a foregone conclusion. The object is to design an ex- 
periment which leaves the computer relatively free from 
limitations but which will still lead to a meaningful re- 
sult. Friedberg did this. 

One way for the machine to write the instructions is 
to have it write random bits into a form word, the form 
and the dedicated spaces being selected to be coherent. 

Now a simple task is envisioned and a monitor rou- 
tine written to test whether the randomly generated pro- 
gram (which Friedberg christened HERMAN) has ac- 
complished the task. If it has it is tried again until the 
probability is high that the task is being done correctly. 

f American University, Washington, D. C. 

1 R. M. Friedberg, “A learning machine,” pt. I, IBM J. Res. and 
Dev ., vol. 2, p. 2; January, 1958. 


If at any step the task is not done then a change is made 
before another trial. It is clear that once a routine which 
can do the task has been arrived at no further changes 
will be made. The procedure can be pictured as a ran- 
dom walk. The number of possible programs is finite; in 
one of my experiments it was 2 96 . The rule for going from 
one trial to another can be chosen in various ways, but 
in the same experiment there were just 64 alternatives 
at each step. The number of routines which satisfy the 
test must be, judging from my results, on the order of 
2 96 /2 12 = 2 84 . For some tasks and for some repertories of 
instructions it can be estimated directly. 


An Example of a Machine Written Program 


Operation 


Operand 

Address 

Data 

QJP 

27 

00 

0447 

0440 

13351604 

COQ 

24 

00 

0443 

0441 

77777777 

LDQ 

22 

00 

0453 

0442 

0 

STQ 

23 

00 

0457 

0443 

57312317 

COQ 

24 

00 

0452 

0444 

0 

QJP 

27 

00 

0452 

0445 

77777777 

LDQ 

22 

00 

0445 

0446 

77777777 

QJP 

27 

00 

0457 

0447 

77777777 

STQ 

23 

00 

0444 

0450 

0 

LDQ 

22 

00 

0444 

0451 

0 

STQ 

23 

00 

0450 

0452 

0 

QJP 

27 

00 

0454 

0453 

77777777 

COQ 

24 

00 

0445 

0454 

54040476 

STQ 

23 

00 

0442 

0455 

77777777 

COQ 

24 

00 

0447 

0456 

77777777 

LDQ 

22 

00 

0451 

0457 

0 

In designing 

these experiments one 

has a tremendous 


number of choices. There is the repertory of instructions 
from which the machine chooses to make up its routine. 
The instructions of this repertory need not be selected 
with equal probability; some can be used more often 
than others. There is the size of the area of the memory 
dedicated to the random routine. There is the task to be 
performed. There is the time allowed the routine to 
make its trial. And there are a multitude of other varia- 
tions, some of which will be mentioned later. 

I first used the simplest repertory to be found capable 
of performing the Sheffer “stroke” function. It is: 
LOAD from y into the accumulator, 

STORE at y from the accumulator, 
COMPLEMENT the accumulator, 

JUMP to y if the accumulator is positive. 

This differs drastically from that used by Friedberg. 

The area of the memory devoted to the random rou- 
tine must be in two parts, one for data and the other 
for instructions; otherwise the machine would try to 
execute data and come to an intolerable halt. Therefore 
the address y of the JUMP order must be interpreted 
differently from those of the LOAD and STORE orders. 

A problem related to that which leads to the separa- 
tion of the two dedicated areas is that endless loops are 
highly probable and intolerable. This problem can be 
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solved by the routine which interprets the jump ad- 
dresses. At the occasion of a JUMP a calculation is 
made about the time of running; if the time is excessive 
then the trial is terminated and called a failure. If the 
time is acceptable then the address to which the jump 
is made is interpreted. 

In an earlier experiment I used a clock to time the 
routines and adjusted the jump addresses with a B-box. 
The use of a clock made checking and debugging very 
difficult and eventually forced me to the later method. 

I have used several sizes of dedicated spaces and plan 
to try still more, but most of the trials have been with 
sixteen lines of coding. Friedberg used sixty-four lines. 
Clearly one must allow enough lines to permit coding 
the task. More than enough slows the “learning” proc- 
ess. To allow just enough prejudices the event and is 
not “fair.” A felicitous solution to this quandary would 
be to find a way to expedite the “learning” so that very 
large areas of memory can be dedicated, thus guarantee- 
ing that the answer has been supplied by the machine. 
It is the object of this study to find such a solution. 

The time allowed to execute the routine can be varied. 
Its absolute minimum is that for three instructions. The 
maximum could be quite large if the routine were re- 
traced several times. Oddly enough the time allowed on 
each trial has only a small effect on the number of trials 
before learning. This may be because when the machine 
has extra time it uses that tifne to destroy what it may 
already have accomplished. If so then two effects tend 
to neutralize each other; the freedom of more time tends 
to lengthen the learning period, and the greater number 
of potential solutions tends to shorten it (Table I). 


TABLE I 

The Effect of Time on Learning 


Time 

Median number of 
trials 

Number of 
experiments 

14 

4400 

31 

10 

5800 

47 

7 

6800 

30 

4 

15,500 

13 


These experiments differed only in the time allowed 
each trial. The time is measured by the number of pro- 
gram steps possible. Each experiment was run until the 
machine had demonstrated that it succeeded (one 
hundred consecutive successes) and then the number of 
trials was recorded. In this set of experiments about 38 
per cent of the trials appeared by accident to be suc- 
cesses. Thus the number of different programs tried was 
about 62 per cent of the total number of trials. As the 
time was shortened it became more difficult to find a 
successful routine. 

The median has been quoted here to avoid a bias 
which might affect the average. The range of the num- 
ber of trials is large, and there might be some prejudice 
against the longest runs, such as stopping them to 
check for faults. 


The task selected was to transfer a word from one 
place to another. This was chosen because of its sim- 
plicity. An easier task was to copy a word at a pre- 
scribed spot from one of several places (redundant in- 
put). A harder task would be to copy a word at several 
places (redundant output). With redundant input (the 
key word written at two accessible places) the median 
number of trials was 2600, compared with 5800 in non- 
redundant experiments. 

The procedure for the machine making changes in the 
routine offers many opportunities to be different. In one 
series of trials the routine was rewritten completely 
after each failure. In another only one instruction was 
rewritten, the instructions being taken in turn in succes- 
sive tests. In another series just one instruction was re- 
written, this time chosen by an elaborate procedure in- 
volving “success numbers.” The success numbers were 
accounts, one for each instruction of the routine, which 
were increased when a success or what appeared to be a 
success happened, and decreased when a failure oc- 
curred. This procedure was roughly the same as that of 
Friedberg's, although not quite as elaborate. 

A comparison of the results of these alternative meth- 
ods were that “learning” occurred most rapidly with the 
first method, about twenty-five hundred errors before a 
success. It was less rapid with the second, about six 
thousand trials, and slowest with the last, over one 
hundred thousand trials. The “learning” is an abrupt 
process, a “flash of insight.” The procedure with success 
numbers resembles that used in other experiments with 
self-improving programs, such as those used by Oet- 
tinger. 2 It seems to be inappropriate here since a line ol 
coding is not itself a unit. If the program were organized 
into units to which success numbers could be appropri- 
ately applied then it would begin to appear that the 
answer to our problem was being built into our ap- 
proach. Success numbers will have an important place 
in the ultimate “learning” machines, but some sort of 
self-improving without them will also be required. 

A Sample of Experiments Arranged in Order 

No. of Trials Apparently Right No. of Trials Apparently Right 


20 

20 

576 

273 

38 

32 

592 

280 

54 

37 

598 

303 

68 

39 

612 

317 

72 

46 

614 

297 

76 

40 

618 

290 

76 

47 

680 

331 

88 

53 

800 

385 

156 

77 

996 

509 

160 

89 

1090 

526 

162 

88 

1100 

526 

234 

131 

1246 

595 

222 

124 

1324 

629 

276 

130 

1412 

688 

294 

153 

1420 

678 

330 

160 

1528 

753 

338 

173 

1616 

782 

352 

160 

2016 

990 

386 

181 

2872 

1366 

521 

249 

3198 

1522 


2 A. G. Oettinger, “Programming a digital computer to learn,” 
Phil. Mag., vol. 43, pp. 1243-1263; December, 1952. 
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TABLE II 

Synopsis of Experiments 


Type of change 

Length 

Time 

Task 

Competing twins? 

Median no. 
of trials 
before 
learning 

Notes 

All 

One at 
a time 

Transfer 

word 

Choose 
an exit 

Yes 

No 


X 

2 

2 


X 


X 

548 


X 


14 

10 


X 


X 

960 

Reversed exit 


X 

14 

10 


X 


X 

1530 

Input redundant 

X 


16 

12 

X 



X 

1567 



X 

16 

12 

X 


X 


1775 



X 

8 

4 

X 


X 


1902 


X 


16 

10 


X 


X 

2097 



X 

16 

10 


X 

X 


2210 



X 

8 

4 

X 


X 


2280 


X 


16 

10 


X 


X 

2608 

Reversed exit 


X 

16 

10 


X 


X 

2683 


X 


16 

10 


X 


X 

2998 

Reversed exit 


X 

16 

10 


X 


X 

3125 

Input redundant 


X 

16 

10 


X 


X 

3610 

Input redundant 

X 


16 

10 


X 


X 

4200 



X 

16 

12 

X 



X 

4300 



X 

16 

10 


X 


X 

4401 


X 


16 

10 


X 


X 

4406 



X 

8 

4 

X 



X 

4988 



X 

16 

8 

X 

* 

X 


5177 



X 

8 

4 

X 



X 

5280 



X 

16 

10 


X 

X 


6700 



X 

16 

7 


X 


X 

6800 



X 

16 

10 


X 

X 


7273 



X 

16 

10 


X 


X 

8600 



X 

16 

8 


X 

X 


9266 



In these experiments the minimum possible space, 
two orders, was allowed for the program. The learning 
was judged complete when 20 successive right answers 
were given. Thus there is a chance of something less 
than one in a million that a given experiment did not 
succeed, despite appearances. Notice the wide range, 
the slowest taking 160 times as long as the fastest 
(Table II). 

These experiments throw some light on this kind of 
“learning,” and particularly on which versions learn 
fastest. The process is analogous to evolution, since a 
routine survives only if it meets the challenge of its en- 
vironment. This analogy suggests some further experi- 
ments where each trial is built on two successful trials, 
perhaps by taking lines of coding from each, much as 
genes are taken from chromosomes. I have hopes that 
routines can be built in this manner to meet complex en- 
vironments and that the number of trials will be only 
the sum of the number for each individual requirement 
rather than the product. 

In the experiments described above once a task has 
been “learned” there is no further improvement. It 
would be desirable for the routine to improve its per- 


formance even after it had demonstrated acceptable 
skill. This can only be done by some sort of flexible cri- 
terion of satisfaction, and by some way of keeping prog- 
ress already made. I have tried to do this by twin learn- 
ing routines. The twins compete to see which will first 
learn the task. When one of them has learned the other 
continues to try, but now it must complete the task 
more quickly than its sister. In this way some improve- 
ment takes place. It is not quicker than the alternative 
of insisting on a high standard ab initio. 

Other techniques need to be tested. One of these is to 
use as components not lines of coding but subroutines. 
In this way the average coherence of the trials should 
be raised. Another is some way of accumulating suc- 
cesses and then using them cooperatively to meet more 
and more complex environments. This resembles bio- 
logical history, where evolution has produced increas- 
ingly complex organisms which become more and more 
effective in dealing with their environment. 

If someone could invent a technique which produced 
programs as effective as organisms in a time which is 
electronic rather than biological it would be a revolu- 
tion in programming. 
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Some Communication Aspects of 
Character-Sensing Systems 

CLYDE C. HEASLY, Js.f 


Introduction 

A LARGE number of bad jokes have been made 
about the “real characters” encountered in the 
reading machine business. If the jokes are not 
too funny, they serve nonetheless to highlight a signifi- 
cant fact. Character-sensing machines must read real 
characters if the machines are to be useful outside of 
the laboratory. 

Practical character-sensing machines have been in 
use for a number of years. Enough experience has now 
been accumulated to warrant an analysis of the envi- 
ronment in which they must operate. Specifically, this 
will involve an understanding of the nature of real char- 
acters, the factors which determine their nature, the 
techniques which may be employed to improve these 
factors, and some of the considerations which must be 
judged in determining which techniques are appropri- 
ate to a given system. 

A Character-Sensing System Model 

Fig. 1 shows a character-sensing machine enmeshed 
in the conventional model of a communication system. 
Messages are sent along this communication system by 
selecting a series of symbol shapes and transferring the 
shapes in inked form to a document. The document is 
then moved by various handling processes to the point 
where the message is to be received. There the docu- 
ment is scanned, the shapes converted to signals and 
the signals decoded. 

As the model is considered in greater detail, the en- 
coder is found to degrade the symbol shapes by varia- 
tion in strength of impression and in amount of inking, 
and by inking noise. On typical business equipment, 
these factors may result in line-width variations of five 
to one, missing portions in light impressions, blotted 
portions of heavy characters, random additional inter- 
ference on either light or dark characters, and a pro- 
nounced ribbon pattern. 

The handling process further degrades the symbol by 
superimposing interference. In the worst cases this may 
be offset endorsement stamp ink, or in less drastic 
cases, a mild smudging of the printed characters. 
Finally, the sensing mechanism and quantizing appara- 
tus cooperate to produce a signal representative of the 
character shape. Whether the decoding of this signal 
will yield the original intended characters depends on 
how carefully information has been conserved all along 
the channel. 

f Intelligent Machines Res. Corp., Alexandria, Va. 



Fig. 1 — The character-sensing machine is the decoder 
of a communication system. 


The Encoder — Practical Data Printers 

Selection of a typeface which complements the read- 
ing scheme or combats typical interference is one of the 
least expensive changes. Apart from tooling costs, type 
bars or wheels are inexpensive. The symbols must be 
similar for the conventional characters which they 
represent and should conform to the limitations im- 
posed by the printing techniques to be used. Within 
these constraints, however, there is ample room for the 
designer to increase the amount of information which 
represents each character. 1 

Regardless of the type shapes used, it is intended 
that the shape be completely black and that everywhere 
else the document be white. It is the job of the printing 
process to approach this intended ideal as closely as pos- 
sible. Direct printing from inked type and lithography 
border on the ideal. Unfortunately, such procetses can 
only be used for fixed or serial data which are applied 
before the document is in field use. 

Variable data are frequently printed by key-driven 
business equipment, typewriters, adding machines, 
cash registers, and the like. Generally these equipments 
print with ribbons and metal type. When the recorder 
is built from scratch, it is desirable to use controlled 
impression and direct-transfer ribbons or the newer ink- 
saturated rubber type. Another class of data recorder 
is the identification imprinter which prints from em- 
bossed plates by means of ribbons, ink-saturated roll- 
ers, or carbon papers. 

1 C. C. Heasly, Jr., “Self check — a new common language,” pre- 
sented at ACM Symp., Los Angeles, Calif.; May 8, 1958. Copies 
available at Intelligent Machines Res. Corp., Alexandria, Va. 
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However, the data recorder is the most numerous ele- 
ment in many character-sensing communications. The 
ability to use existing business equipment with only 
minor modifications often provides the economic im- 
petus for a character-sensing system. Thus the system 
designer may find himself limited to changing typeface 
and perhaps use of an improved ribbon. 

A third source of variable data is tabulators and vari- 
ous high-speed printers which record the output of data 
processing systems. Tfiese printers for the most part 
use ribbons and, because of their high speed, a direct 
transfer ribbon becomes a very costly nuisance. 

The preponderance of ribbon printers and low-cost 
recording devices virtually forces system designers to 
examine their output and to find ways to recover the 
information which they record. The principal character- 
istic of ribbon printing is the dot structure imposed by 
the ribbon itself. The size of the dots varies with ribbon 
inking, strength of impression, and chance alignment 
between type and ribbon. Under light impression and 
light inking, some dots may fail to print. Under normal 
inking, the dots blend together to form lines of appreci- 
able width with irregular boundaries. Heavy inking and 
pressure may result in considerable bleeding of ink well 
beyond the intended boundaries of the character shape. 
In nearly all cases, unintended spots occur in the nom- 
inally white areas of the character, their number and 
darkness being dependent on inking and impression. 

These dots of varying size and intensity are not indi- 
vidually reliable. However, they may be aggregated by 
statistical operations to yield reliable bits which define 
the strokes of the characters. The printing resolution of 
the dots, usually 8 to 10 dots from top to bottom of a 
character, may be statistically “quantized” to a reliable 
vertical resolution of five bits. This is equivalent to 
stating that a five-bit code might be printed reliably in 
the vertical space occupied by a character symbol. As 
will be shown, this resolution is about the minimum 
which can be used to print symbols which resemble con- 
ventional character shapes. Fig. 2 shows enlarged views 
of typical characters under three different impressions. 

The Communication Channel— Handling Noise 
and Discriminating Ink 



Fig. 2 — Character line widths and noise vary 
greatly with inking and impression. 


is possible to use inks which have more than one sensible 
property. The most widely known is of magnetizable ink 
which has optical properties for human recognition and 
magnetic properties for machine sensing. A second ap- 
proach which has likewise been used in the banking 
problem is the use of glossy inks. Here varnish agents 
cause a glossy character surface which can be sensed in 
combination with direct black to discriminate against 
all but the extremes of handling noise. An even more 
powerful approach can be achieved by use of a combi- 
nation fluorescent ink which appears black under nor- 
mal illumination, but which fluoresces under special il- 
lumination. Here again, combination sensing tech- 
niques may be used to overcome interference due to 
handling. 

It is worth noting that selection of a printing medium 
is in no way helpful in combating the noise resulting 
from the printing process itself. For example, the car- 
bons used in credit identification invoice forms are the 
chief source of noise in that system. Efforts to reduce 
this noise by improved carbon have been very reward- 
ing, but no advantage would be obtained by using mag- 
netic or fluorescent material, since the noise would 
have the same property as the characters. 

Sensing Methods 


Handling noise, surprisingly, can often be controlled 
by fairly simple methods. For example, on bankchecks 
the largest sources of noise are the handwritten figures, 
signature, etc., and wet endorsing ink offset from the 
back of one check to the front of the next. Intelligent 
format design can isolate handwritten data from the 
reading field. Endorsing stamp offset can be controlled 
by restricting the variety of ink colors to those which 
can be discriminated by color analysis scanning. 

Regardless of the printing method, the choice of 
printing medium offers interesting possibilities. Con- 
ventionally, the sole requirement of inks is good optical 
contrast. However, where the primary source of noise 
is in the handling rather than in the printing process, it 


Perhaps the most important system choice and cer- 
tainly the most interesting from an analytical point of 
view is the choice of a sensing method. The two choices 
worth serious consideration are amplitude scanning and 
two-dimensional scanning. 

For quite a while these scanning schemes were close- 
ly associated with the medium in which the characters 
were sensed. Magnetic character sensing has been as- 
sociated almost exclusively with amplitude scanning, 
while optical character sensing has been based on two- 
dimensional scanning. However, two-dimensional scan- 
ners have recently been developed for magnetic char- 
acters, and an amplitude scan system has been an- 
nounced for optical character sensing. 
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The principal difference between the two schemes is 
the way in which they respond to the two-dimensipnal 
information in a character. Two-dimensional scanning 
covers each point of the area, usually in a scanning 
raster similar to television. Amplitude scanning com- 
presses information in one direction into an amplitude 
signal. For example, in magnetic sensing, amplitude 
scanning is accomplished by passing the characters 
under a vertical gap. The scan signal responds to the 
total amount of magnetic ink passing under the gap. 
As reading progresses, the read head produces an out- 
put function related to the horizontal distribution of 
magnetic material along the character. Although it re- 
sponds to the vertical amount, it cannot respond to the 
vertical distribution of the contributing elements. This 
results in a loss of information. 

Superficially considered, the amount of information 
lost by amplitude scanning may seem trivial. However, 
even casual analysis will reveal that the signal re- 
covered from scanning a conventional “2” is virtually 
identical with that of a conventional “5.” Similarly, the 
difference between conventional “4” and “ 9 ” or “0” and 
“8” are alarmingly small. In practice these pairs may 
be rendered different by changing the horizontal scale 
of one or by changing widths of vertical strokes. This is 
a perfectly valid solution to the problem, but these ex- 
amples serve to underscore that shape differences 
easily detected in a two-dimensional scan are obscured 
by amplitude scanning. It is of some interest to gain a 
more exact knowledge of information contained in char- 
acters and of the ways in which it is preserved or lost. 

Analysis of Scan Information 

The maximum amount of information which can be 
used to convey a set of character symbols may be said 
to be the number of bits which can be resolved by scan- 
ning and quantizing the “printing cell” occupied by one 
character. It will be shown presently that the value of 
this resolution will depend on the scanning method, 
but for the moment let resolution be considered inde- 
pendently of scanning method. This may be thought of 
as the channel capacity in bits per symbol. While chan- 
nel capacity could be used as the independent variable, 
it is both more convenient and more meaningful to re- 
late results to an independent vertical resolution which 
may be defined as the number of bits which can be re- 
solved along a vertical line extending from the top to 
the bottom of the cell. 

The manner in which amplitude scanning reduces in- 
formation can be easily understood from an example 
based on a vertical resolution of “4,” although better 
resolutions are required if the constraint that the sym- 
bols resemble conventional characters is to be obeved. 

With a vertical resolution of 4, 16 different inputs, 
Xu may occur. The amplitude scanner can respond to 
such inputs by providing 5 different outputs, y 3 -. Fig. 3 
is a matrix which shows the joint probabilities P t j re- 
lating the 16 inputs and the 5 outputs. 
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JOINT PROBABILITY, P,j MATRIX 
AMPLITUDE SCAN, VERTICAL RESOLUTION 4 


Fig. 3 — Probability matrix shows how amplitude 
scanning response loses information. 


The information content H{X) of the input (which 
for the present is assumed to be equiprobable) can be 
seen to be 4 bits. 

H(X) = - £ Pi log. Pi 

= — 16(1/16 log 2 1/16) = 4. 

The probabilities, Qj, of the five scanner outputs are 
found by summing the elements of each row of the 
matrix. From these in turn the output information 
Y) is found. 

s(y) = - E a logs a- 

= - (2/16 log 1/16 + 8/16 log 4/16 + 6/16 log 6/16) 
= 2.03 bits. 

Thus, for every vertical column of 4 bits in the input 
channel, an amplitude scanner captures 2.03 bits and 
loses 1.97 bits. Regardless of the number columns in the 
cell, nearly half of the input information is destroyed. 

The graph of Fig. 4 shows how the efficiency of this 
type of scan decreases as the resolution increases. The 
actual channel capacity (input information), condi- 
tional information (lost information) and transinforma- 
tion (output information) are shown for a character cell 
having aspect ratio 5 to 4 (vertical to horizontal). 

Character Information 

Whether or not this information lost is meaningful, 
of course, depends on the extent to which the input sym- 
bols utilize the channel capacity. Fig. 5 shows a set of 
symbols based on a vertical resolution of 5 with 5 to 4 
aspect ratio. It can be seen that this is about the poor- 
est resolution which can be used and still obtain ten 
symbols which resemble the numeric digits. It can also 
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Fig. 4 — Efficiency (utilization index) of amplitude scanning 
decreases as resolution increases. 




Fig. 5 — Exemplary numeric font based on vertical resolution 
of 5 uses 15.4 bits (out of possible 20). 


be seen that variations in style are quite restricted. This 
font has been deliberately designed to circumvent the 
limitations of amplitude scanning by changing the hori- 
zontal scale of the “2” to differentiate it from the “5.” 

The channel capacity is 20 bits. The type font utilizes 
15.4 bits, of which 8.18 bits are recovered by the ampli- 
tude scan. 

Vertical Resolution and Quantizing 

It may not be readily apparent that the vertical reso- 
lution as defined does not have the same value for a 
two-dimensional scan as for an amplitude scan. This 
difference arises from the methods whereby quantizing 
may be used to effect resolution. 

A two-dimensional scan system offers the possibility 
of quantizing in two steps. The first step involves ex- 
amining each elemental area of the image to determine 
whether it is intended to be black or white. The second 


step involves performing a statistic on the results of 
the first step to determine whether a particular small 
area is intended to be black or white. It is in this sense 
that the resolution has been defined, the small areas 
which can be resolved in a printing cell being the bits 
which comprise channel capacity. 

An amplitude scan must perform the equivalent 
quantizing in a single step and must quantize to a larger 
number of levels, in the 5 to 4 case just shown, an am- 
plitude scan must have the ability to quantize to six 
output levels in order to resolve the five vertical bits. 

The significance of this relationship between quan- 
tizing and resolution is more apparent in the case of 
nonuniform impression. Fig. 6 shows the two examples 
of the character 5, one printed light and one dark. It is 
at once obvious that a two-dimensional scanning system 
could easily resolve either input character with a verti- 
cal resolution of 5 bits. The transmitted amplitude-scan 
information is shown below in the two examples. If the 
two amplitude-scan signals are normalized so that maxi- 
mum responses are equal, quantizing to six levels will 
produce the two different results shown. In order for 
printing to have resolution of 5 bits for amplitude scan- 
ning, vertical line widths of horizontal strokes have to 
be constant within plus or minus one-sixth of the nomi- 
nal line width. It can be shown that this is equivalent 
to a vertical resolution of twenty-five bits for a two- 
dimensional scan. 

Thus it is seen that, if amplitude scanning is used, 
the freedom from handling interference which magnetiz- 
able inks and magnetic sensing offer, is purchased at 
the cost of considerable loss of recovered information. 
This may be compensated by greatly improved resolu- 
tion. Either the printing cell must be greatly increased 
in area or the printing must be held to very close toler- 
ances. Since character area is at a premium in many ap- 
plications or fixed in advance by the printing mecha- 
nism, close printing tolerance is apt to be the only answer. 

Quantizing and Decoding 

In most practical character sensing decoding equip- 
ment, it is difficult to draw a fine line between the quan- 
tizing mechanism and the decoding mechanism, since 
these system elements often work in close cooperation. 
It is probably true that the real sophistication in char- 
acter-sensing machines is in the quantizing mechanism. 
It is in improved quantizing methods that information 
may be preserved and engineering considerations of 
simplicity, reliability, and cost may be most profitably 
pursued. 

Once the information in a character has been sensed 
and quantized, the decoding of the resulting signal is 
fairly straightforward. Choice of logic schemes is not 
as complicated as might be supposed. The success of one 
decoding scheme as compared to another will depend 
primarily on the efficiency with which it utilizes the re- 
dundancy of the quantized signal to promote accurate 
decoding. 
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FOR VERTICAL RESOLUTION OF 5 


Fig. 6 — Amplitude scanning is greatly affected 
by variations in impression. 


Thus the character is seen to be a code which is su- 
perior to other conventional codes only in the excess of 
information it embodies and in the ease with which de- 
coding can be performed by human beings who have 
learned to read. It might be remarked that a fair por- 
tion of the information present in real characters has to 
be used in combating the low fidelity and noise intro- 
duced by conventional printing methods. These con- 
siderations aside, there is little choice between bar or 
dot codes and conventional characters so long as area 
and printing resolution are identical. 

Once the nature of the character is identified, it be- 
comes quite apparent that any decoding scheme will 
give better performance if it has more information as 
its input. 

Summary 

The attempt here has been to consider the entire 
communication system of which the character-sensing 
equipment is but one functional part and to identify 
both the factors which reduce information transmitted 
by the system and the techniques which may be used to 
insure that the message is finally received ungarbled. 

If there is any single conclusion to be drawn, it is per- 
haps that the best way to achieve economical charac- 
ter sensing is to consider the whole communication sys- 
tem in which it is embedded and to improve the infor- 
mation transmitted by every element of the system in- 
sofar as economical and system requirements permit. 


Beyond this the points at which the most improvement 
per dollar could be obtained might be re-emphasized. 

Since the printers are frequently the most numerous 
elements in the system, the extent of change is usually 
limited. Changes of typeface are inexpensive and may be 
quite effective in improving the amount of information 
per character. Improvements which enhance printing 
resolution are the most rewarding. A simple increase in 
the character cell area is as effective as improving the 
printing tolerance, but frequently neither improvement 
can be made without printer redesign. Reduction in 
printer interference is particularly worthwhile, espe- 
cially if special inks are needed to combat handling 
interference. 

The best solution to handling interference is to con- 
trol the source of interference. If interference cannot be 
eliminated by document design and improved handling 
procedures, it may be possible to restrict the kinds of 
interference to those which can be discriminated by 
special sensing techniques. If this is not possible, then 
use of special inks which may be detected outside of the 
interference medium is useful. 

The scanning employed should recover as much of 
the character information as possible. From an infor- 
mation point of view, it may prove to be better to use 
scanning which conveys all of the character informa- 
tion even though degraded by interference, than to scan 
in a medium which is free of interference with a scan- 
ning technique which degrades the character informa- 
tion. This last conclusion is particularly true when 
close printing tolerance cannot be maintained, when al- 
phabetic or alphanumeric information must be trans- 
mitted, when the typeface cannot be controlled in ad- 
vance, or when serious interferences are inherent in the 
printing process. 

With respect to quantizing, considerable care is war- 
ranted to insure that the information is quantized 
rather than some other variable which is only casually 
related to the information. 

In this regard it is well to note all a priori information 
about the symbols and the printing process. Full utiliza- 
tion of this information in the quantizing is the only 
way to insure maximum information at the decoding 
stage. 

Finally, it might be said that system design of char- 
acter-sensing communication systems is primarily a 
matter of selecting that set of system elements consist- 
ent with the problem requirements which will convey 
the maximum amount of information. It is hoped that 
this survey of system elements and their characteristics 
has been helpful to that end. 
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An Approach to Computers That Perceive, 

Learn, and Reason* 

PETER H. GREENEf 


T HE purpose of this paper is to mention some of 
the problems which I believe will be central ones 
in the effort to design machines that can in some 
sense be said to perceive, think, learn, make reasonable 
inferences, or learn to perform acts of skill. First I wish 
to discuss a general conceptual outlook which has im- 
portant consequences for our approach to the design of 
such machines. I shall show how this outlook calls for 
modifying or supplementing some current approaches; 
however, specific techniques for dealing adequately 
with the problems raised remain to be found. The 
second part of the paper will point to specific tech- 
niques which now exist for dealing with the problem of 
reasonable inference and inductive judgment, but which 
to my knowledge have not been considered in connec- 
tion with computers and control mechanisms. 

Polya [34] gives certain heuristic principles whereby 
we tentatively arrive at judgments of the likelihood of 
propositions. Yet he categorically states that there can 
never be a machine which can perform such plausible 
inferences. Why is this? I suggest that one reason is 
merely that he did not anticipate progress which has 
occurred since his book in the analysis of such inferences. 
I shall discuss this progress later in this paper. 

Another reason for Polya’s pessimism is more funda- 
mental. Present-day machines cannot really be said to 
contain the meaning of the propositions they handle. 
Every proposition is reduced to a hole or an electric 
charge or a magnetic field at some point of space. Thus 
the propositions may be sorted and combined but their 
meaning resides in the mind of the person who looks at 
a list which says that a hole in this place means that 
proposition. Pattern recognition, too, depends in all but 
a few experimental computers upon preset responses 
which do not take meaning into account. Thus it has 
become fashionable to talk in terms which exclude 
meaning. We recognize the next letter or word, so it is 
said, because we know the transition probabilities. 
This is only a partial solution ; it ignores the outstanding 
fact that we are able to use relevance and meaning as 
guides, rather than probabilities alone. The first topic 
will concern a general conceptual outlook which is es- 
sential if computers are ever to become capable of 
having this too. 

* This research was supported by the U. S. Air Force through the 
Air Force Office of Sci. Res. of the Air Res. and Dev. Command 
under Contract No. AF 49(638)-414. Reproduction in whole or in 
part is permitted for any purpose of the U. S. Government. 

t Committee on Mathematical Biology, University of Chicago, 
Chicago, 111. 


Most of our perception is too rich to be described in 
what is called a discursive code, a one-dimensional sort 
of language that can be written on a typewriter or 
spoken or put on magnetic tape. Thus we shall need to 
make use of non-discursive symbols (or presentational 
symbols in Susanne Langer’s terminology) which are 
rich enough to embody complex patterns in their en- 
tirety as presented to the computer. Such recognition, 
pattern by pattern instead of element by element, has 
been discussed before — for instance, in connection with 
Uttley’s conditional probability machine [35]. How- 
ever, the basic mechanism for the recognition of pat- 
terns in all these studies is that of abstraction, which is 
supposed to account for the recognition of complex pat- 
terns and general ideas. This makes use of association, 
the formation of a linkage between representations of 
similar patterns. 

That abstraction as generally defined is totally in- 
adequate as a fundamental principle has been pointed 
out convincingly by many authors, who unanimously 
arrive at that judgment from diverse starting points. 
For their arguments I refer you to the tfibliography on 
judgment and concepts. A first inadequacy of abstrac- 
tion as the fundamental mechanism of concept forma- 
tion involves the person’s or computer’s way of ap- 
proaching the matter to be examined. If I hold up my 
hand with one finger extended, what makes you look in 
the direction of my finger instead of the direction of my 
shoulder, or indeed look in any direction whatever? The 
answer is that the meaningfulness of the gesture de- 
pends upon a background of conventions which deter- 
mine one’s way of approach and which must precede 
any alleged abstractions. Thus more is needed than 
abstraction [12 ] . 

The second fundamental difficulty is that abstraction 
is said to proceed by the formation of groups of things 
which are similar in regard to the abstracted concept. 
But the formation of these classes presupposes some 
notion of the concept just to determine which things are 
similar in regard to it. It presupposes at least a certain 
“point °f view” from which the elements can be desig- 
nated like or unlike. The development of the concept 
then involves a further articulation of this general point 
of view [3 ] . 

The third fundamental difficulty is that abstraction 
works by forgetfulness. Cassirer points out that this 
separates us more and more from the pattern and pro- 
duces something superficial. He contrasts this with the 
generality reached, say, when a mathematician general- 
izes. For instance, it is a generalization to say that cir- 
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cles and ovals are all instances of the same concept, 
namely ellipses of various eccentricities; but this gen- 
eralization, rather than neglecting the particular fea- 
tures, embodies a universal rule for the connection and 
ordering of the particulars. In general, the essential 
feature of the general concept is a generating relation 
which gives a principle of ordering and dependence. We 
have seen that this is presupposed by the ability to form 
classes of similar elements. It is also presupposed by the 
ability to make reasonable judgments about a whole 
group of forms without running through the particu- 
lar examples — often an infinite set — individually. To do 
this we need some representation of a generating rule 
of construction [3]. 

Thus the first task which precedes abstraction is not 
the comparison of representations but the formation of 
impressions into representations adequate to sustain 
abstraction. We tend to think that abstraction alone is 
sufficient because this first task has for the most part 
already been performed in infancy and in the construc- 
tion of our traditional language. We therefore pre- 
suppose that the raw materials for our judgments are 
not disconnected particularities, but are already given 
to us as a connected manifold. The essence of these argu- 
ments was first arrived at by Kant in 1781 [2] , [5]- [7], 
[9], [10]. 

The consequences for computer design are many. 
First, we must find a way of building up meaning, 
which, though it may construct hierarchies of patterns 
in a way called for by many authors, must use essen- 
tially richer means of construction than the means 
called for by those authors. That is to say, we need more 
than the principles of association and abstraction, al- 
though these will be important ingredients. We shall 
see later that abstraction itself, in its valid aspect, re- 
quires more than association. Second, the construction 
of the unitary rule of connection of a manifold — the 
“first universal” of the 19th century logician Lotze — 
requires a principle of construction which supplements 
the relation of class inclusion which is appropriate to 
the formation of the “second universal” — the class of 
similar objects. It is this latter relation alone which has 
been studied in any detail by people interested in com- 
puters. Third, since the truly general concept contains 
structure which includes the generative principle of the 
system in which it stands, it follows, as Kant and Cas- 
sirer indicate, that no content can be posited without 
thereby positing a complex of other contents. In other 
words, true generality embodies the judgments of causal 
relations which we are seeking to embody in computers 
[2], [5], [6], 

Fourth, we must not regard the linguistic concepts of 
the computer as copies of a definite world of facts; if so, 
we are doing the machine’s job of drawing the significant 
outlines of meaningful units. Rather, the computer, be- 
fore it compares experiences, must in Cassirer’s words 
concentrate the experiences and distill them down to 
one point. This point is the unit which is at present 


handled in the ingenious ways known to computer de- 
signers, but here we see that an involved process must 
go into its formation. Since the particular outlines con- 
structed depend upon generative principles, it would be 
expected that any conceptual product of such a com- 
puter would be constructed out of manifolds of related 
precursors formed around individual generative princi- 
ples built into the machine. This appears to be the 
case in human thinking [l], [13 ] , [l 7 ] , [19] — [2 1 ] . 

Fifth is the need to supplement those notions of the 
relations between a language and the world that have 
so far been worked out, for example, Carnap’s treat- 
ment of mathematical semantics. In treatments like this 
a sentence is true if and only if the fact to which it refers 
happens to be the case. Thus they assume an extra- 
linguistic world of facts open to observation. In our 
problem, on the contrary, observation is possible only 
through symbolic forms contained in our language; 
hence we may say that for us, or for the machine which 
we envisage, the world of observables comes into exist- 
ence along with the language. The same drawback is 
apparent in the analysis of meaning given by Morris, 
which is very frequently cited in connection with our 
problem. The first link between information theory and 
meaning, namely the notion of semantic information of 
Carnap and Bar-Hillel, suffers from the plight of logical 
atomism, the doctrine that complex ideas may be 
analyzed into one of the 2 N possible conjunctions of N 
atomic facts or their negations. Please bear in mind 
that I am not making a single complaint against any of 
the methods I have cited. These methods, together with 
all existing proposals for logical networks, must be sup- 
plemented by something which produces the meaningful 
symbolic forms which are the raw material for these 
methods and networks. At present it is the human 
operator, not the machine, which produces this raw 
material [ 1 1 ] , [29 ]— [3 1 ] . 

I shall conclude this conceptual part of the paper 
with a few remarks on what we would mean by subjec- 
tive action and thinking in a machine. The question is 
very involved and I am not competent to provide an 
adequate answer. I shall therefore name only a few 
properties which we can all agree on as necessary, and 
which will lead to the discussion of specific techniques 
which are now available. 

We may all agree that one feature that we seek is the 
presentational nature of the internal symbolism of the 
machine, involving the ability to perceive patterns too 
complex to be communicated verbally and the ability 
to acquire skills which we are not able to program dis- 
cursively for the machine. Since the internal workings of 
the machine would proceed by the manipulation of non- 
discursive symbols, while the only known means of 
communication with the machine would be via dis- 
cursive symbols, we could find out how the machine was 
arriving at its conclusions only by a long process of in- 
ference, and we could modify its basic patterns only 
through a gradual process of “training.” The patterns 
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would have enough internal structure so that new in- 
formation could be obtained by operations based upon 
content, rather than the external operations of sorting 
and propositional logic which present computers use. 
The symbolic operations, rather than the visible re- 
ceptor structure, would be the entities closely related 
to subjective meaning. For instance, we should not 
want to call the pattern on a television screen or the 
visual cortex subjective. These are patterns which are 
causal transformations of external objects. The con- 
struction of subjective meaning from these raw ma- 
terials must proceed in the ways which I have discussed 
in the first part of this paper, and in fact does proceed 
in that way in human beings; the fundamental units of 
perception being dynamic, unstable visual effects which 
are combined into a coherent image, and which may 
become dissociated in pathological states. Feeling is 
more closely related to these processes of combination 
than to nerve pathways, as we observe when we feel the 
texture of an object with another object such as a pencil 
held in our hand and tend to localize the meaningful 
feeling in the end of the pencil rather than in our hand. 

By consulting the references provided on psychology, 
it will be possible to learn how all the concepts I have 
discussed are embodied in examples of what I believe to 
be the only plausible and nontrivial psychological re- 
sults and hypotheses which actually deal with the 
mechanisms for the synthesis of meaning. Anyone inter- 
ested in the problem of intelligent machines should con- 
sult these references to see how one might go about 
building meanings on a basis of non-discursive symbols 
[l3]-[24]. 

Next I wish to discuss that aspect of subjective mean- 
ing that involves our feeling that the meaning should in 
some sense be self-contained — that the machine read 
its own dials and in some sense understand them the 
way we would. In this way the machine’s consciousness 
would be a collection of internal feedbacks which had 
some special feature. What is this feature? I think one 
aspect may be seen by means of an example. Goodman 
[32] uses a tick-tack-toe diagram to represent iso- 
morphically the pattern of hostilities existing among 
four gorillas in a zoo. The four lines stand for the four 
gorillas, and the four intersections stand for the four 
relations of hostility which happen to obtain. To us in- 
tersecting lines seem unlike gorillas so we say that the 
full meaning is not self-contained. On the other hand, 
our mental gorillas seem just like physical gorillas so 
we say that we contain the meaning. Since physical 
gorillas are known to us only by means of mental goril- 
las, of course they seem similar. For our present pur- 
poses we might say that the machine contains the 
meaning of what it perceives in case its internal dials 
register symbols whose complexity compares favorably 
with the complexity of our own ideas. 

This notion suggests a topic which I think is worth 
investigating. Reichenbach [26] remarks that it seems 
to be a general property of semantical systems that pre- 


cise concepts in the object language may arise from 
vaguer concepts in the metalanguage (the language 
which talks about the object language). Can we find 
languages embodying the complexity of our experience 
which are based upon a hierarchy of meta- and meta- 
metalanguages and so on down to a language so simple 
that it can be adequately embodied in the electronic 
units (or the physiological units) available to us? This 
possibility would be consistent with the psychological 
views which I have mentioned. 

The last aspect of subjective meaning that I wish to 
discuss is the one which will allow us to proceed from 
the terra incognita of the synthesis of symbolic forms 
to more welcome regions in which valuable resources 
have already been discovered. This is our well-known 
ability to become acquainted with causal relations and 
with the potentialities of things and actions, and to 
make appropriate generalizations and inductive in- 
ferences. This part of my paper involves the recent 
progress mentioned in connection with Polya’s pessi- 
mism regarding intelligent machines. This progress in- 
volves methods which were devised mainly with refer- 
ence to meaning and inference in the methodology of 
science. However, we can use them to handle the con- 
tent of our science, which involves just these problems 
of inference. Perhaps it will be best to list a substantial 
number of specific logical tools so that you may get a 
general idea of the scope of these methods and turn to 
the reference for details of any that sound interesting 
[25]-[28], 

The basic fact to remember is that all these methods 
deal with the weighing and balancing of judgments 
against a background of relevant material which in 
general is not explicitly expressed, and the resulting 
rules of logical manipulation differ considerably from 
the customary symbolic logic of isolated propositions. 
Among other differences, the validity of a statement 
will depend not only on what is presented but on how it 
is presented, with logically equivalent statements often 
not having equal validity. Validity will involve matters 
of past linguistic usage and thus the whole previous 
history of how the world has been described and antici- 
pated by use of the language. Thus these techniques 
deal realistically with common sense judgments in their 
full complexity of context and suggest that inductive 
judgments in machines may be a real possibility in the 
not too distant future. 

To begin the list we may consider a problem treated 
by Goodman [25]. This is the problem of contrary -to-f act 
conditional statements, or counter factual conditionals. 
These are statements such as “If I had struck the match 
it would have lighted,” which we regard as true, and 
“If I had struck the match it would not have lighted,” 
which we regard as false. Since we did not strike the 
match, and the match may not exist at present, these 
conditionals (which are thus contrary-to-fact) may not 
be directly verified, and the question arises as to what 
criterion we use to establish the truth of one and the 
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falsity of the other. The ability to judge what conse- 
quences would result from certain circumstances or ac- 
tions which are not actually present is an essential part 
of intelligent behavior. It turns out that the problems 
to be met and criteria to be formulated in judging 
counterfactuals suggest specific tasks that would have 
to be performed by components of intelligent machines. 

The first problem is to define the set of relevant con- 
ditions which are necessary for the truth of the counter- 
factual (oxygen, dryness, etc., in our example). We can- 
not include all true statements because many of these 
conflict with the antecedent of the conditional, since by 
definition it is a contrary-to-fact conditional. Here is 
another blow at simple association as an adequate basis 
for relevance. We need more structure. It turns out to 
be a difficult problem to specify in advance what type 
of information must be considered. We shall come to 
an application of this specification later. 

The next consideration is that not all conditionals 
may be used counterfactually, but only certain ones 
which express laws as opposed to mere facts. The struc- 
ture which distinguishes laws from facts turns out to 
have potential applications to structures in machines, 
and is moreover intimately connected with the rule that 
it is not the truth of laws but their inductive verifica- 
tion that makes them laws. Reichenbach in particular 
gives rules for the structure of laws that may be trans- 
latable into rules of structure and function in inductive 
computers. The problem of counterfactuals leads 
Goodman to the problem of establishing dispositions — 
the potentialities of things to behave in certain ways 
under certain circumstances — on the basis of behavior 
which has been actually manifested. The ability to em- 
body dispositions is another important feature of in- 
telligence. Now the class of objects having a given dis- 
position (e.g., flexible) is broader than the class of ob- 
jects having the manifest property (in this case, is 
flexed). The problem of extending the smaller class to 
the larger, which Goodman calls projection, leads him 
to a consideration of induction. The main problem here 
is that of choosing among the infinite variety of gen- 
eralizations consistent with given evidence, and this is 
another important feature of intelligence. Goodman and 
Reichenbach give rules for the resolution of conflicts 
between generalizations which seem as though they 
could be embodied in a conditional probability machine 
like that of Uttley, or else in models of the nervous sys- 
tem proposed by Hebb and others. The exact form of a 
proposition is important to its mode of confirmation 
since it is possible for the same evidence to confirm one 
proposition yet be irrelevant to the truth of a logically 
equivalent proposition [25] (“paradox of the ravens”). 

Reichenbach [26], [27] concentrates upon the formal 
properties of implication in ordinary and counterfactual 
conditionals and in laws which make them seem reason- 
able or unreasonable. It is his great achievement to have 
formulated rules concerning the detailed logical struc- 
ture of propositions which amazingly allow statements 


that seem reasonable to common sense and exclude most 
statements that seem unreasonable, despite the fact that 
the difference is one which seems most subtle and elusive 
and which depends upon the exact form in which the 
statement is expressed in addition to its extensional 
truth value. I shall select certain topics which will be 
applicable to computer design. His rules are generally 
such that they could probably be programmed for com- 
puters with no more trouble than any other program 
now employed. 

Reichenbach first discusses the rules for a form of 
implication (which he terms connective) which ex- 
presses necessary connections, and thus differs from the 
usual form of implication of symbolic logic (or adjunc- 
tive implication), which merely expresses the fact that 
certain conditions are adjoined. To know the truth or 
falsity of the adjunctive implication “if A then B” one 
must first ascertain the truth or falsity of “A” and of 
“B” and call the conditional true in case these truth 
values stand in a certain relation to each other, whether 
or not there is any natural connection between A and B. 
The connective implication, on the other hand, asserts 
a connection between A and B; it asserts that if “A” 
were true then “B” would be true, and we can decide 
whether such a connection exists without knowing the 
truth or falsity of “A” and “B.” Connective implications 
are involved in most reasonable inferences. 

Using his notion of connective implications, Reichen- 
bach is able to define a class of nomological statements, 
which correspond to laws of nature and logic, and a 
narrower class, the admissible statements, which cor- 
respond to ordinary reasonable statements. He dis- 
tinguishes three orders of truth, analytic, synthetic 
nomological, and factual, and is able to arrive at a 
characterization of reasonable statements based solely 
upon the orders of certain parts of the statements. 
Then he states a variety of rules for the transformation 
and combination of reasonable statements which differ 
substantially from the ordinary rules of symbolic logic. 
He proposes modified rules to cover what he terms 
semi-adjunctive implications, which make assertions 
about individual events with no claim that the same 
result will obtain upon repetition. 

With these rules, also conceivably adaptable to com- 
puters without too much trouble, we come to our last 
group of possible applications to intelligent computers. 

First we consider the problem of structuralization 
raised by the question of what are the relevant condi- 
tions. Practically everybody who has considered the 
problem of learning in computer or person considers the 
basic operation to be a certain kind of association whose 
inner structure is not known or else assumed to be 
simple. Sellars shows that if it is the case that the opera- 
tion of entailment is a simple connection, then unreason- 
able results can be avoided only if the entities considered 
in the judgments are classified into four types. The sys- 
tem must be able to distinguish among and deal with 
expressions for kinds of things, kinds of circumstances. 
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something done to a thing, and something it does in 
return. The descriptive function “x is F at time t” obeys 
different logical rules depending upon which category 
is represented by “ F. ” Even the phrase “at time t” 
means something different in each case. On the other 
hand we can simplify the entities and put all the com- 
plexity into the notion of entailment, which must then 
contain specifiable logical complexities. These considera- 
tions are of greatest importance in deciding upon basic 
operations adequate to produce reasonable judgments. 

Second, another form of counterfactual, the counter- 
factual of noninterference, states “ ‘A’ is true. Even if 
‘B’ were also true ‘A’ would still be true.” Although 
these are more complicated grammatically than the 
ordinary counterfactual, Reichenbach shows that the 
criteria for their validity require only notions of condi- 
tional probability. Hence they could be built into 
Uttley’s machine with only minor modifications. 

Third, Reichenbach shows that for many purposes 
we may use the ordinary adjunctive operations instead 
of the more complicated connective ones and not arrive 
at unreasonable consequences provided that we do not 
admit negations of lawful statements. When in this 
usage we wish to deny a connective implication, we 
must do so by saying “Do not use!” in the metalan- 
guage. This is another sense in which negations may be 
absent in addition to the frequently remarked absence 
of negations of presentational symbols. 

Fourth, Reichenbach discusses the important case in 
which some result follows lawfully from a factual state- 
ment. Although the result is a fact, and hence not nomo- 
logical or lawful, it is, however, nomologically derivable 
from a fact, and it is termed nomological relative to the 
fact. In the interesting case we use an elliptic form of 
speech which omits mention of the law. If this can be 
validly done, the relative nomological statements are 
called separable. These account for the important kind 
of reasoning which uses implications for counterfactuals 
even though they do not directly represent laws of 
nature. The rules of manipulation of these statements 
predict correctly that what we regard as reasonable 
logical operations may vary with the extent of our 
knowledge. Since in practice we never draw the bound- 
aries of the system under consideration so wide that 
all relevant information is explicitly included, the 
separable statements assume the greatest importance. 

Fifth, Reichenbach employs the term proper implica- 
tion to refer to a class of separable implications all taken 
relative to some convenient reference set of background 
information. A majority of our implications will be of 
this form. According to Reichenbach, the rules for the 
application of connective logical operations to these 
statements are so complicated that in general, in order 
to construct derivations one must use the simpler but 
less appropriate adjunctive calculus and then check the 
results to see whether they are reasonable. He concludes 
that the class of reasonable statements is thus not com- 
plete, in the sense that in order to construct derivative 


relations between members of this class we have to go 
beyond the class. It would in this sense appear that 
not only on the lowest levels of the synthesis of mean- 
ingful symbols, but at the highest propositional levels, 
too, reasonable statements may be produced only by a 
process of rejection of a multitude of lawfully produced 
but unreasonable statements. 

In summary, we have seen that present-day com- 
puters deal primarily with external relations among 
concepts which are given in a form that does not repre- 
sent their “inner structure.” In order for a machine to 
discover new patterns and to make inductive generaliza- 
tions, it appears that the following notions must be 
taken into account. 

The primary challenge to be met is that of forming 
impressions into logical elements. This involves Kant’s 
notion of the synthesis of a connected manifold of im- 
pressions— that is, the individual element must contain 
the schemata for its connections with other elements. 
No conceptual product can exist in isolation from all 
other conceptual products; rather, each concept must 
contain a partial representation of many other concepts. 
Thus, as Kant first noted, spatial and causal relations 
are not derived from experience; experience is derived 
from them — they are prerequisites for experience of 
certain types. The consequence for computer design 
is that one must not seek to build a machine that per- 
ceives and conceptualizes and then inserts into its con- 
cepts independently achieved schemata of spatial and 
causal relations. On the contrary, in order to perceive 
at all it must perceive according to schemata, and the 
spatial and causal schemata must be an inherent part 
of the percepts and concepts themselves. 

It appears that any activity of a computer rich 
enough in structure to deserve the term concept must 
be expressed in a language which is given meaning by a 
hierarchy of metalanguages whose initial members are 
simple enough to be organized rather directly around 
structures in the computer of a technological level which 
is perhaps within our present command. 

Much of the activity in the earlier levels of this hier- 
archy would then consist in operations upon non- 
discursive symbols, and in their development from a 
state in which the basic computer structures are repre- 
sented by a multitude of particular presentational 
symbols to a progressively articulated state amenable to 
the rules of discursive logic. In short, it seems that if a 
computer is to “think,” its concepts will have to undergo 
the process of development that Freud [13 ] termed the 
primary process. 

Finally, the logic, of inferences and inductive generali- 
zations made against a background of tacit premises and 
relevant information is quite different from ordinary 
propositional logic, but some of its rules are already 
known in a form which may be used at the present time 
in programming computers. A computer will probably 
have to arrive at reasonable inferences by selection from 
a larger set of inferences which are not all reasonable. 
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I hope that this selection of ideas from a mass of 
technical, logical, and philosophical investigations, and 
from certain psychological ideas not generally con- 
sidered in connection with computers, may help provide 
a program for computer research, and that bringing this 
program to your attention may be of some use in adding 
to the present capabilities of computers and control 
mechanisms. 
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Automatic Data Processing in the Tactical Field Army 

A. B. CRAWFORDf 


T HE use of digital computer techniques has become 
mandatory in the whole realm of ballistic missile 
design and guidance. And we are rapidly ap- 
proaching the era in which computer techniques sim- 
ilarly might well become a necessity in many phases of 
ground combat. The feasibility of applying these tech- 
niques in a tactical warfare environment has been estab- 
lished, and the Department of the Army has a high pri- 
ority program underway to place a prototype Automatic 
Data Processing System (ADPS) into its tactical or- 
ganization within the next four years. This paper will ex- 
plain the design objectives of this system and the prog- 
ress to date in the applications area, and describe the 
extension of several techniques required to make the 
system function. 

The technical design objectives may be summarized 
in about three key phrases — integrated, common-user 
service center, employing general-purpose hardware. 
The system in employment has been likened to the wide- 
spread tactical communication system, that is, a net- 
work of switching centers operated by specialists but 
tying together all users or customers, largely through 
the use of common-user trunks and switchboards. More 
specific design guidelines include the following: 

1) Common-user facilities with a minimum number 
of single-user processors; 

2) Integrated data files; 

3) Simple foolproof input devices; 

4) Interrogation-immediate reply; 

5) GP, building-block hardware; 

6) Automatic operation; 

7) Flexible output. 

The tactical ADPS will consist of several elements. 
(See Fig. 1.) Simple, foolproof input devices will be 
located at the source of data input. Connecting the in- 
puts to the data-processing center and centers with each 
other is the Army’s area communication system. Accord- 
ing to the work load and applications being processed, 
each center will employ one or more general-purpose 
computers. Finally, a whole range of output devices is 
available according to the type of output desired, or 
rather required, by the commander to facilitate his 
making a sound decision — volatile but rapid video-dis- 
play graphical or map-overlay form, or hard-copy re- 
ports. 


t Automatic Data Processing Dept., U. S. Army Electronic Prov- 
ing Ground, Fort Huachuca, Ariz. 
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Fig. 1 — The elements of the tactical ADP System are located in such 
a fashion as to minimize the burden on the users of System 
operation. 


Functionally, the system is portrayed as an integra- 
tion of subsystems according to the military staff func- 
tion mechanized. The Army’s traditional categorization 
is the well-known G-l, 2, 3, and 4 or Admin., Intelli- 
gence, Operations, and Logistics. In each of these cate- 
gories, however, the uses of information are similar — 
that is, immediate reaction, longer range planning, and 
historical preparations. Our system design must be 
tailored to meet the rigorous specifications of high speed 
for the immediate reaction use as well as the capacity 
for large amounts of file maintenance and processing. 

Looking at the over-all system configuration, one sees 
a far-flung complex of data processors and data trans- 
mission links, the magnitude of which is presently 
unique. As is well known, the tactical Army’s command 
structure runs from the Company headquarters to Bat- 
tle Group, Division, Corps, Army, etc. It appears desir- 
able — and in fact a necessity — to install a data-process- 
ing capability within each headquarters from Battle 
Group back. The physical size and processing abilities 
will of course increase as we proceed toward the rear of 
the Combat zone. In fact, at Corps and Army more than 
one computer will be linked together to make up that 
data-processing center. 

Now, everything described to this point might be 
called long-range planning. The title of the system dis- 
cussed is ARMYDATA, and the general time-frame is 
beyond the immediate future. Nevertheless, we are 
guided in the immediate systems developments by these 
objectives. The remainder of this paper will describe the 
Army’s project to attain a prototype ADP System as 
an intermediate step toward reaching the ultimate. 

Slightly less than two years ago, a major project was 
launched within the Army to place into operation as 
soon as possible a system which could support the ever- 
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increasing military requirements for mobility and ef- 
fectiveness in combat. Because of the long lead-time 
inherent in the R & D cycle for complex digital com- 
puters, the decision was made to parallel the develop- 
ment of the system’s hardware and the system’s con- 
cepts and applications. (The full description of the so- 
called FIELDATA family of equipment has been left 
to the companion paper prepared by Captain Luebbert 
of the Signal R & D Laboratories.) 

At the time the decision was made to proceed on the 
hardware, a plan of action was laid out whereby the 
entire Army’s methods for information handling in the 
field were to be scrutinized in great detail. The ultimate 
objective was a detailed description of each user’s data- 
processing requirement with the more immediate by- 
product of streamlining the manual methods. 

Those persons associated with business data-process- 
ing systems are familiar with the peculiarity of the sys- 
tems problems involved in mechanizing integrated 
business procedures of any magnitude, and here I use 
the word “system” to mean “procedural” structure. We 
propose to integrate some 50 or more separate pro- 
cedures (for as many customers) from concepts de- 
veloped initially in an isolated fashion. That is, the 
logistics applications are being derived through systems 
analysis by the logistics specialists, the combat intel- 
ligence applications by G-2 specialists, and so on. To 
take full advantage of the great power of automatic 
data processing, we must mold or integrate these indi- 
vidual requirements into a realistic yet imaginative 
working system which meets the needs of the diverse 
staff requiring its services. The types of applications to 
be satisfied again can be broken down into the tradi- 
tional areas of Operations, Intelligence, Logistics, and 
Admin. A different categorization according to data- 
processing techniques might be Combat Control (ran- 
dom access), Combat Support (batched processing), 
and Combat Computation. 

Approximately twenty separate study groups are an- 
alyzing some seventy separate potential ADP applica- 
tions. To assure that these studies will be more directly 
usable, an instructional booklet has been distributed 
setting forth a very rigid report format and content. 
The fact is that the area specialists are not experienced 
in the techniques of systems analysis, so we designed the 
booklet more or less to lead the study group “by the 
hand” through the steps of an ADP application study or 
systems analysis. Further, through contracts and the 
use of our own systems analysts, we are providing tech- 
nical assistance to the study groups, particularly as 
they reach the phase of proposing an automated pro- 
cedure for their problem. 

Upon completion the individual study reports are sub- 
ject to a technical review to determine feasibility, com- 
pleteness, and the amount of dependence on other ap- 
plications. Analysts return to the agency which con- 


ducted the study to fill in details and to eliminate am- 
biguities. The object of the next step is to define the 
application to the detail required for programming and 
to derive certain workload data to be used in the over-all 
system design. 

About this time, too, the preparation of digital models 
of the system and its parts is to begin. This, I believe, 
implies the underlying approach to our whole problem: 
simulation! We shall be utilizing all means and all levels 
of digital simulation techniques. First of all, interpre- 
tive simulation must be employed to permit the prep- 
aration and checking out of the computer programs 
prior to the delivery of the militarized computers. A 
successful simulation of the MOBIDIC on the IBM 
709 has been completed, and so far we have put through 
successfully a few math subroutines, an intelligence 
filing and retrieval experiment, a combat surveillance 
target acquisition simulation, and a limited payroll 
problem. To extend this technique we are attempting 
now to develop a simulator generator which will enable 
us to experiment inexpensively when altering machine 
parameters or order codes. (This latter is an attempt 
to lay the groundwork for the later definition of the 
advanced system characteristics and computer de- 
signs.) 

Before the detailed design of the prototype system is 
launched, a subset of the available application areas 
will be selected for implementation. Only then can we 
say specifically how comprehensive the FIELDATA 
system will be in covering tactical procedures. Since it 
is recognized that the hardware cpmponents are experi- 
mental in nature, in order to meet a tight schedule with- 
out a crash program the FIELDATA system will itself 
be experimental in nature. 

Parallel to the detailed programming of each applica- 
tion will be the use of a digital model of the system in- 
formation flow to permit a prediction of the needed data 
rates, alternate routing, and potential bottlenecks. A 
model is being used now using the 709 which simulates 
a general army area communications complex. Prob- 
ability distributions and Monte Carlo techniques are 
employed throughout, from the preparation of a mes- 
sage entering the system to human switchboard operator 
actions and to message processing as it progresses 
toward its destination. A statistical analysis routine 
produces model-run data reduction and also recom- 
mends a sample size according to our level of confidence 
desired. 

To explain the next type of simulation, first I shall 
outline the organization of the ADPS Test Facilities at 
the Proving Ground. Computer test facility is around a 
large-scale computer center, specifically the IBM 
709. Herein all of the simulation work will be conducted, 
and herein will be the means for conducting controlled 
environmental tests for predicting the validity of our 
proposed computer procedures. The field-test facilities 
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are for the purpose implied by the name and will nat- 
urally be employed to try to prove out the simulation 
and model results. Even during these field-test phases 
we are to utilize simulation techniques. In this context 
the 709 will be linked to that part of the system being 
subjected to evaluation in a field operational environ- 
ment; a single thread employment of equipment will be 
supplemented by simulation of the remainder of the 
system. (See Fig. 2.) That is, the computer will provide 
the data sink to introduce input into the system and to 
absorb output from these echelons actually being op- 
erated. 

The Computer Center became operational in Feb- 
ruary, 1959. The larger part of the application studies 
are completed, and detailed analysis has begun on 
several of them with demonstration runs already made 
on at least two. The major hardware items of the proto- 
type system are on order, with the first to be delivered 
this coming Fall. Combining this progress with that 
reported by Captain Luebbert on hardware, transmis- 
sion, and programming aids we remain confident that 
our objectives can be attained. 

In summary, this paper has reported on an ambitious 
and futuristic program undertaken by the Signal Corps 
to provide the Army with a vast tactical Automatic 




Fig. 2 — The IBM 709 will serve as a source and destination of sys- 
tem input and output by simulating the missing echelons during 
field tests. 


Data Processing System. The proposed system in proto- 
type form is to be operational by 1963 and will incor- 
porate the very latest developments in digital tech- 
niques, i.e., new miniaturized general-purpose data- 
processing devices, computer-to-computer communica- 
tions, and automatic programming. The research ef- 
forts in this project, and certain standards derived, are 
bound to have an effect on and contribute to related 
commercial data-processing activities. 


Data Transmission Equipment Concepts 

for FIELD AT A 

W. F. LUEBBERTf 


I TIELDATA is an integrated family of data process- 
I ing and data transmission equipment being de- 
veloped for Army use. A unique feature of this 
family is the almost complete disappearance of conven- 
tional distinctions between communications and data 
processing. This paper deals primarily with the concepts 
and techniques developed to create this evolutionary 
merger emphasizing the ways in which conventional 
communications concepts have been adapted to achieve 
a high degree of interoperability with computers and 
other data processing equipment, and an extraordinary 
degree of flexibility and adaptability of application. 

In order to explain and illustrate the FIELDATA 
concepts, this paper makes extensive use of specific ex- 
amples of design decisions, particularly those dealing 
with common features such as codes, voltage and im- 
pedance levels, data rates, etc. Among the equipments 

f U. S. Army Signal Res. and Dev. Lab., Fort Monmouth, N. J. 


of the FIELDATA family developed in accordance with 
these concepts and common standards are the following, 
all of which are scheduled for completion prior to the 
end of 1960: the MOBIDIC computer (Sylvania), the 
BASICPAC and LOGICPAC computers (Philco), the 
AN/TSQ-35 19,200 bit/second data transmission equip- 
ment (Bendix- Pacific), the AN/TSQ-33 2400 bit/ 
second data transmission equipment (Collins), the 
AN/TSQ-32 1200 bit/second data transmission equip- 
ment (Stelma) , the DATA COORDINATOR, a facil- 
ities coordination and control equipment for an inte- 
grated communications and data processing system 
(IBM), and a host of miscellaneous equipments such as 
magnetic tape transports (Ampex), a flexowriter-like 
electric typewriter (Smith- Corona), high-speed print- 
ers (Anderson-Nichols), security equipment (Collins), 
etc. 

The fundamental capabilities of data processing 
equipment can be described as the ability to transform 
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information into more desirable or useful forms. Using 
the same viewpoint, the fundamental capability of con- 
ventional communications equipment can be described 
as the ability to transfer information to a more desirable 
or useful location. Of course, from an abstract logical 
viewpoint a transfer is merely one special kind of trans- 
formation, one in which only physical location is 
changed. Combining these descriptions, one is led to the 
concept of a generalized data handling system capable 
of performing generalized transformations, of which 
conventional one-location data processing would be one 
special case and conventional no-processing data trans- 
mission would be another special case. In such a system 
there would be no fundamental distinctions between 
data processing and data transmission, but only dis- 
tinctionsof convenience based upon application, use, and 
design emphasis. 

Is such a concept a reasonable one, and does it have 
any practical utility? The answer to the first part of this 
question will be examined in detail in this paper; the 
answer to the second part will be determined by the suc- 
cess in actual use of the equipment and system concepts 
it generates, the concepts explained in this paper which 
are receiving initial implementation in the FI ELD AT A 
family of equipments. 

An examination of the information flow and manipu- 
lation in typical data processing and data transmission 
equipments shows almost immediately that there is a 
very considerable mixture of functions going on in both 
types of equipment. A very considerable amount of the 
activity going on inside any computer or data processor 
consists of simple transfers of information from one part 
of the processor to another. Within data processing 
equipment a major part of the activity is concerned with 
the generation, manipulation, and other processing of 
information used for control, supervisory, and error re- 
duction purposes. In many cases nearly identical opera- 
tions go on in both data processing and data transmis- 
sion equipments, with the differences, if any, being mat- 
ters of design emphasis based upon application and use. 

Many practical cases of this similarity are immedi- 
ately obvious, particularly in the area of devices used 
for data entry and output. For example, computers fre- 
quently use paper tape readers similar to those used in 
teletypewriter transmitter distributors, and paper tape 
punches that could easily be used in teletypewriter 
reperforators. Similarly the idea that kinds of input- 
output devices such as card readers and punches and 
magnetic tape transports which are widely used for data 
processing can effectively be adapted for use of com- 
munications lines is also being exploited in equipments 
such as the IBM transceiver, the Collins Kinetape, etc. 

FIELDATA emphasizes this kind of exploitation to 
the extreme, particularly encouraging it by assuring 
that interconnection of semiautonomous equipment 
modules be made in accord with common standards 
without distinction whether these equipments are con- 
ceived primarily for computer-associated or transmis- 


sion-associated functions. This makes it possible for the 
same data terminals to operate not only with data proc- 
essing-type inputs and outputs such as computers, paper 
tape, magnetic tape, and IBM cards; but also for it to 
operate with real time weapons system data and with 
telegraphic data. 

Control circuitry is so devised that pure binary as 
well as alphanumeric ( alphabetic-numeric ) data may be 
handled. Since any digital code, be it Baudot (teletype- 
writer) code, Holerith (IBM card) code, or any of a 
wide variety of computer codes may be represented in 
binary bit-by-bit form, the FIELDATA devices have 
the potential of transmitting or handling any type of 
digital data. Thus, they could be used with digitized 
voice, digitized facsimile, or other types of digitized an- 
alog signals. 

The use of common standards, codes, and standard 
data rates makes possible the kind of data transmission 
equipment concept shown in Fig. 1. This concept leads 
naturally to a division of the subassemblies of data 
transmission equipment into three kinds: 

1) Input-output transducers are devices for convert- 
ing information from some human or machine us- 
able form such as paper tape, magnetic tape, 
punched cards, analog electrical voltages, strokes 
on a keyboard, etc., into digital form. 

2) Transmission transducers are devices for convert- 
ing data in digital form into appropriate signals 
for transmission over radio, wire or other kinds of 
propagation media. 

3) Embolic 1 equipment , normally inserted between' 
input-output transducers and transmission trans- 
ducers, is used primarily to perform control and 
supervisory functions, error detection and/or cor- 
rection, buffering, and/or speed conversion, code 
conversion, or encryption necessary for proper 
system operation of the data transmission equip- 
ment. The functions of embolic equipment are in- 
formation processing functions. Inputs and out- 
puts will both be digital in form, although supple- 
mentary analog information may also be available 
particularly in some kinds of error control 
schemes. A general-purpose computer is poten- 
tially a very powerful and flexible type of embolic 
equipment, but the necessary functions can often 
be performed much more economically by spe- 
cialized equipment. 

This division of subassemblies may be a physical divi- 
sion into separate items of equipment, into semi- 
autonomous parts of a single equipment (either in a 
separate box or in a single box) or it may be merely 
conceptual with no physical implementation. 

1 Embolic is a coined term from the Greek embolisimos meaning 
to put between or insert. This word is also used in medical, astronomic, 
and ecclesiastic literature to describe other specific kinds of inter- 
calations. 
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ANY OF M ANY OF N 

INPUT OR OUTPUT DEVICES TRANSMISSION DEVICES 

CHOSEN TO MEET USER CHOSEN TO MEET COMMUNICATIONS 

DATA FORM REQUIREMENTS MEDIUM AND PROPAGATION REQ’M’TS 


paper tape reader or punch 
magnetic tape transport 
card reader or punch 
electric typewriter 
printed character reader 
high-speed printer 
computer input or output 
etc 



COMMON LANGUAGE 
EQUIPMENT 
for 

error control 
supervision, 
cryptography, 
etc. 



amplitude modulation sequential transmission 
frequency-shift keying sequential transmission 
phase reversal keying sequential transmission 
frequency-shift keying multiple subcarriers 
phase quadrature modulation multiple subcarriers 
etc. 


Fig. 1 — FIELDATA equipment modularity concept. 


Obviously maximum flexibility and adaptability can 
be achieved if any input-output transducer can operate 
with any transmission transducer. This permits one to 
tailor equipment to meet the requirements of a particu- 
lar situation. It permits one to choose the in-out device 
suitable for the user’s most convenient data form (paper 
tape, magnetic tape, IBM cards, etc.) independently of 
the nature of the transmission facility available. One 
can then choose the transmission transducer on the 
basis of the transmission medium used (VHF radio re- 
lay, HF radio, loaded cable, carrier transmission on wire, 
etc.) independently of the user’s data form. Then join 
the two together taking advantage of common stand- 
ards of interconnection or intercommunication between 
modules to create a well-tailored combination. 

The question then arises, “Why embolic equipment?” 
Certainly by proper choice of common interconnection 
or intercommunications characteristics one can mini- 
mize requirements for code conversion, buffering, speed 
conversion, etc. Ideally one should be able to join the 
input-output transducer to the transmission transducer 
without embolic equipment, so why have it? The answer 
is that there are three important functions in a data 
terminal which seem convenient to separate from both 
in-out transducers and transmission transducers: 1) 
communications supervision, 2) error control, 3) cryp- 
tographic security. 

Supervisory functions are conveniently separated 
from the input-output transducers and the transmission 
transducers because supervisory requirements may be 
strongly influenced by both. Error control requirements 
may also be strongly influenced by both the in-out data 
"and the transmission situation, and may even in some 
cases not be desired at all. Thus it is best treated as a 
module which can be tailored to the paired require- 
ments of in-out and transmission or completely omitted. 
Cryptographic security is conveniently a separate 
module so that it may be omitted in those cases where 
security is not required or desired. 


In order to discuss supervisory activities conveniently 
it is desirable to make a distinction between two kinds 
of information which flow through a communications 
system : 

1) Primary information is that which a user wishes 
transferred to another location, that is, the in- 
formation to be communicated. 

2) Secondary information is added to the primary 
information either by the originator or by equip- 
ment of the communications system which is used 
to perform functions of supervision, routing, error 
control and related activities necessary or desirable 
to permit the primary data to be effectively com- 
municated. This information is used by communi- 
cations equipment and personnel and is normally 
of no use or interest to the ultimate recipient of 
the primary information. 

A basic requirement for maximum flexibility and 
adaptability is that the user, who enters data at an input 
device and receives data at an output device, need not 
be required to exercise judgment or knowledge, to per- 
form special activities, to use or to interpret secondary 
information. This, in turn, makes it desirable to isolate 
secondary information from the input-output devices, 
which are the user’s point of contact with the communi- 
cations system. A convenient way of establishing and 
enforcing this isolation is the creation of a distinct em- 
bolic module which generates, receives, interprets, and/ 
or acts upon secondary information and passes on ac- 
tion requirements derived from this secondary informa- 
tion over local control lines to its associated input-out- 
put transducer and transmission transducer. It is im- 
portant to note that such equipment need not have any 
ability to interpret or act upon primary data, and thus 
its operation can be made completely independent of 
the coding used for primary data so long as there is a 
unique method of distinguishing primary from sec- 
ondary data. 
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In the FI ELD AT A family of equipments the segrega- 
tion of primary and secondary data is accomplished in a 
particularly simple way by providing separate transmis- 
sion symbol “alphabets” for primary and secondary 
data. The basic unit of data in FIELDATA codes is a 
six-bit character. For primary data this may be merely 
a six-bit block with no specific meaning assigned to spe- 
cific bits within it, and no restrictions on the permissible 
code combinations which may be transmitted. If this 
information is alphanumeric in form so that specific bit 
meanings must be assigned for electric typewriters, 
printers and similar input-output transducers to oper- 
ate, then the FIELDATA alphanumeric code given in 
Appendix I is used. A similar but distinctly different six- 
bit FIELDATA supervisory data code has also been 
created and is outlined in Appendix II. Message head- 
ing and addresses, dialing information, multiplexing in- 
formation, signals indicating start and end of message 
blocks, and control signals are all created using this 
alphabet. 

In many situations, for example inside computers, it 
is desirable to both the six-bit primary data alphabet 
and the six-bit FIELDATA supervisory code alphabet 
with implicit differentiation between them similar to 
the implicit differentiation between data and instruc- 
tions in the computer. However, other situations arise 
where it is desirable to provide an explicit identifying 
tag to specify which alphabet is being used. 

A basic tagging method has been adopted for use on 
interconnecting cables employed for joining equipment 
modules. A seventh or tag bit is added to each six-bit 
symbol: a binary “one” if the symbol is primary da- 
ta, and a binary “zero” if the symbol is from the 
FIELDATA supervisory alphabet. In the interconnect- 
ing cables an eighth bit in the form of an odd parity 
bit is also added to provide error protective redundancy. 
This basic eight-bit form creates the appearance of an 
eight-bit code. It has widespread use, but cases do exist 
where other alphabet tagging techniques and other 
redundancy is preferable. For example, serious compli- 
cations would arise if this basic eight-bit form were used 
for the actual punching of FIELDATA information on 
paper tape. This occurs because control difficulties oc- 
cur with a paper tape punch-reader system when pri- 
mary data symbols are allowed to use either the blank 
tape condition (no channels punched) or the deleted 
tape condition (all channels punched). Since the use of 
the basic eight-bit form inevitably results in one or the 
other of these symbols being a primary. data symbol 
depending upon whether a hole is interpreted as a 
binay “zero” or as a binary “one,” a slightly different 
tagging and parity scheme must be adopted. This, of 
course, results in another eight-bit form which could 
be interpreted as an eight-bit code. Other tagging and 
redundancy schemes could lead to other apparent codes. 
Thus, there is no such thing as a unique eight-bit 
FIELDATA code, although it may be convenient to 
think of the basic eight-bit form as an eight-bit code. 


In some situations transmission errors may create 
important reasons for using other tagging and error pro- 
tective redundancy schemes. Errors which cause loss or 
change in supervisory information such as dialing in- 
formation, message headings, start and end of message 
indications, and so forth, may completely disrupt the 
proper functioning of a communications system. Thus 
they may require protective redundancy many times 
more powerful than the simple parity used in the basic 
eight-bit form. However, compared to these secondary 
data, primary data may be capable of tolerating consid- 
erably higher error rates. For example, if the data are 
English text the inherent redundancy of the language 
may permit significant corruption without loss of intelli- 
gibility. Numerical data may consist of successive ob- 
servations of a physical phenomenon in such a form that 
inconsistent data may be deleted and ignored, or they 
may be protected by numerical checks similar to those 
used by accounting systems to detect bookkeeping er- 
rors. In a situation where error rates were severe it 
might not be desirable to apply as powerful error con- 
trol to these less demanding primary data as to sec- 
ondary data; thus one might desire to use methods of 
differentiating between primary and secondary data 
which are more resistive to corruption by errors than the 
simple tags used in the basic eight-bit form together 
with different error control schemes for primary and 
secondary data. 

This kind of differentiation illustrates a fundamental 
assumption about error control which is part of the 
FIELDATA concepts. Specific error control require- 
ments should be determined by the nature of the data 
and their use. As mentioned above there may be cases 
where primary data can tolerate frequent errors with 
little loss of usefulness, but there are other cases, such 
as the transmission of computer programs, where very 
low error rates are required. In some cases if errors 
above the desired maximum rate occur and are detected, 
the erroneous information may be deleted without sig- 
nificant harm; in other cases they must be corrected. In 
some cases where correction is required it may be de- 
ferred and handled by a service message ; in other cases 
it must be corrected before the data are released, and so 
forth. 

Notice that all these requirements are determined by 
the use of the data, and that these demands remain the 
same regardless of the transmission path and types of 
transmission equipment through which the data may 
be required to flow. However, the occurrence of errors 
is anything but independent of transmission factors. Al- 
though errors do occur in input-output devices and 
embolic equipment, by far the most variable and diffi-* 
cult to control errors normally occur in the transmission 
portion of a data link. These errors are often quite vari- 
able in frequency and interrelated in occurrence. They 
are quite strongly dependent upon the nature of the 
propagation medium and the kinds of disturbances to 
which it is subject. For example, an HF radio link sub- 
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ject to severe fading and multipath disturbances could 
hardly be expected to have the same error problems as 
a wire and cable link subject to intracable crosstalk and 
impulse- type switching noise. Furthermore, for a given 
propagation medium and kind of noise and disturbances, 
the frequency and interrelationships of errors are 
strongly dependent upon the characteristics of the 
transmission transducers used. For example, if ampli- 
tude modulation is used one would expect different 
errors than if frequency shift keying were used; if 
sequential transmission of short bauds on a single sub- 
carrier is used one would expect different error problems 
than if parallel transmission of long bauds on multiple 
subcarriers is used; and if sampling or nonintegrating 
types of detection are used one would expect different 
error problems than if full integrating detection schemes 
were used. 

The number, variability, and difficulty of measure- 
ment and analysis of the various factors which con- 
tribute to the frequency and interrelationships of trans- 
mission errors or particular circuits is staggering. At the 
present time the state of the art is such that only crude 
estimates of frequency of error can be made for typical 
equipments when exposed to disturbances other than 
Gaussian noise, and that practically nothing can be esti- 
mated in advance about the interrelationships of errors 
under practical conditions of impulse noise, crosstalk, 
propagation variations, etc. This is particularly unfor- 
tunate because the effectiveness of the various digital 
error control schemes available is strongly dependent 
upon the interrelationships of errors. For example, a 
simple parity check is capable of detecting single errors 
but not double errors. If errors occur randomly, double 
errors will seldom occur and this very simple check will 
be quite powerful. Thus if the bit error rate is 10~ 4 , a 
simple parity check will reduce the undetected error 
rate to 10~ 8 . On the other hand, if errors tend to be 
clustered a parity check will be rather ineffective. Thus 
if the conditional probability of a second error immedi- 
ately following the first is 0.5 and the bit error rate is 
10 -4 , then a parity check will reduce the undetected er- 
ror rate to only 0.5X10 -4 . Given knowledge of the in- 
terrelations, checks can be designed which give high 
protection with a minimum amount of checking equip- 
ment. Unfortunately this knowledge is usually unavail- 
able. 

If the most important sources of errors are in a data 
communications link, is it proper to incorporate the 
major error control features of the link into the trans- 
mission transducers? The FIELDATA concepts answer 
this question with a resounding NO! Why? The key rea- 
son is that while the occurrence of errors and the raw 
error rate and characteristics are determined primarily 
by transmission factors, the error requirements are deter- 
mined by the use of the data. The means and techniques 
of error control appropriate to a particular situation ob- 
viously depend upon the interaction of these factors. 
However, it is a fundamental modularity principle of 


FIELDATA that the characteristics of transmission 
transducers should as nearly as possible be independent 
of the details of user input-output characteristics and 
data employment. 

If one were to incorporate the error control features 
into the transmission transducers and one desired to 
provide p classes of controlled error service to users with 
q modulator/demodulator assemblies, then one would 
require p times q types of complete transmission trans- 
ducers. The obvious answer is to separate modularly, 
making the error control module an item of embolic 
equipment. This allows one advantage of similarities in 
requirements and raw error characteristics among the 
different situations to reduce the variety of equipment 
to be constructed. 

In view of the present difficulties in predetermining 
the specific error characteristics of transmission trans- 
ducers prior to construction and test, it also permits 
construction of new transmission transducers and their 
use with existing error control embolic equipments until 
the specific error characteristics of the transducer can 
be measured and new error control embolic equipment 
designed if necessary to meet user requirements with the 
measured transmission error characteristics. 

In addition to these practical advantages of placing 
error control responsibilities in embolic equipment 
modules rather than in transmission transducer mod- 
ules, there are conceptual advantages associated with 
maintaining the simplest possible information flow 
patterns and division of activities among the three basic 
kinds of assemblies. 

In general transmission, transducers pay no attention 
to the information content of the digital information 
they convert to modulated transmission form, neither 
knowing nor caring whether the data are primary or 
secondary, whether they are redundant or irredundant, 
or what code or codes they use. In contrast to this, em- 
bolic equipments normally act as information processing 
devices. Thus, in supervising a transmission link em- 
bolic supervisory equipments act on sensory informa- 
tion received from transmission transducers and in-out 
transducers and generate, process, and interpret sec- 
ondary supervisory information to control the over-all 
operation of the communications link. The error con- 
trol problem is exactly parallel. Acting on information 
about user requirements from the in-out transducer 
side, and information and sensory information about 
transmission errors from the transmission transducer 
side, error control equipment is required to generate, 
process, and interpret error control information using it 
to control (often via supervisory operations) the over- 
all operation of the communications link in such a way 
as to control its errors. Thus it is obvious that from 
the information processing viewpoint the performance 
of error control as an embolic function has a close 
parallel to other embolic functions and is distinctly dif- 
ferent from the functions otherwise performed by trans- 
mission transducers. 
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Some of the FI ELD AT A equipments being developed 
in implementation of these concepts are listed in Table I. 

The transmission transducers of the FIELDATA 
family are limited in number. However, the choice of 
the 75X2” pattern of data rates permits widespread 
augmentation by minor modification of existing or de- 
velopmental teletypewriter multiplexed transmission 
equipments. In addition, future expansion is simplified 
by the fact that future equipments will be able to 
utilize the same embolic and input-output equipments, 
and will thus be cheaper to develop than data transmis- 
sion equipments which require development of embolic 
and/or input-output equipment as part of the same 
package. Expected future expansion will place greater 
emphasis on transducers for radio circuits. 

TABLE I 


Transmission Transducers 


AN/TSQ-32 

AN/TSQ-33 

AN/TSQ-35 

600-1200 bit/second 
600-2400 bit/second 
19,200 bit/second 

Stelma Corp. 

Collins Radio 
Bendix-Pacific 

General Purpose Computers 

MOBIDIC 

BASICPAC/LOGICPAC 
INFORMER/DATA COORDINATOR 

Sylvania 

Philco 

IBM 

In-Out Transducers 

Electric Typewriter 

Paper Tape Reader 

Paper Tape Punch 

High-Speed Printer 

Paper Tape Transport 

Magnetic Tape Transport 

Tacden 

Smith- Corona 
Smith-Corona 

Smith- Corona 

Anderson Nichols 
Ampex Corp. 

Ampex Corp. 
Aeronutronics Systems 

Special Embolic Equipment 

CV-689 

CV-690 

CV-691 

Cryptosecurity Adaptor 
Control Equipment 

Data Concentrator 

Collins Radio 

Collins Radio 

Collins Radio 


The AN/TSQ-32 is a transmission transducer capa- 
ble of accepting data from a standard FIELDATA 
connection and of transmitting it serially as frequency- 
shift modulation of a single subcarrier over a standard 
voice channel. Transmission rates are 1200 bits/second 
(1500 words per minute) over good quality circuits, 
and 600 bits/second over poorer circuits. 

The AN/TSQ-33 is a transmission transducer capa- 
ble of accepting data from a standard FIELDATA con- 
nection and of transmitting it as 8 channels as synchro- 
nous phase quadrature modulation of four subcarriers 
over a standard voice channel. Transmission rates are 
2400 bits/second (3000 words per minute) over good 
quality circuits, with 1200 and 600 bits/second avail- 
able for use over poorer circuits. This equipment is 
essentially a militarized, miniaturized, FIELDATA 
compatible version of the Kineplex TE-206. 

The AN/TSQ-35 is a transmission transducer capable 
of accepting data from a standard FIELDATA connec- 
tion and of transmitting it by amplitude modulation of 


8 subcarriers orthogonally spaced in frequency and 
time. The transmission baseband is the 48-kc band be- 
tween 12 and 60 kc used for military and commercial 
cable and carrier circuits. Representative circuits are 
the Army spiral-four and associated AN/TCC-7, 8, and 
11 equipments; and commercial types N and K carrier 
equipments. In addition to point-to-point full duplex 
operation, this equipment has special features for multi- 
ple station “common net” round-robin type operation. 

In addition to these transmission transducers spe- 
cifically designed for FIELDATA a wide variety of ex- 
isting military and commercial teletypewriter transmis- 
sion equipments can be used for FIELDATA transmis- 
sion with only minor modification. Examples of such 
equipment are the AN/FGC-54 capable of transmission 
and diversity reception of FIELDATA information at 
2400 bits/second over long-haul 3-kc radio circuits using 
32 channels each operating at 75 bits/second. Another 
example is the AN/FGC-29 potentially capable of 
transmission and diversity reception FIELDATA in- 
formation at 1200 bits/second using 16 channels each 
operating at 75 bits/second. Yet another example is the 
AN/TCC-30 potentially capable of transmitting 1200 
bits/second using 16 channels operating at 75 bits/ 
second. 

The FIELDATA computers form the most complete 
and well-balanced portion of the FIELDATA family. 

The FIELDATA computers may act as either input- 
output transducers or as embolic equipments for data 
transmission purposes, having the ability to accept, 
process, and emit either primary or secondary data. All 
are designed for direct operation with data transmission 
equipment. In their employment the MOBIDIC, which 
was the first machine to have its characteristics frozen, 
is the least capable of transmission of machine data; it 
has serious restrictions on its use of the supervisory code 
functions. The most flexible in its employment of data 
transmission will be the DATA COORDINATOR, a 
newer equipment which will have the capability of ter- 
minating a large number of data transmission circuits si- 
multaneously and which will have a number of special 
capabilities and console positions to facilitate its use as 
a facilities coordination processor for an integrated com- 
munications/data processing system. 

All the FIELDATA computers are general-purpose 
processors of modular design and great flexibility. Their 
relative computation speeds are shown on the bar chart. 
All are designed for field use with operation and main- 
tenance simple enough for field personnel. The largest, 
MOBIDIC, mounts in a semitrailer van, while the 
others mount in shelters which can be carried on a 
truck. It is interesting to note that the minimum assem- 
bly which forms a fully operational stored program 
computer of the BASICPAC/LOGICPAC type or the 
INFORMER/DATA COORDINATOR type weighs 
in either case less than 150 pounds subdividable into 50- 
pound or smaller packages. However, these central proc- 
essors are normally used for data processing purposes 
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in vehicular assemblies which far exceed these weights 
because of auxiliary equipments such as multiple tape 
transports, special real-time input-output units, addi- 
tional magnetic core memory modules, data transmis- 
sion equipment, input-output converters, etc. For ex- 
ample, a vehicular BASICPAC would augment the 
central processor with four magnetic tape transports, a 
high-speed paper tape reader and punch, an AN/TSQ- 
33 transmission transducer, and other auxiliary equip- 
ment. Fig. 2 indicates the relative computational capa- 
bilities of the various FIELDATA processors. 



Fig. 2 — -Speeds of computation, FIELDATA processors. 


The input-output transducers of the FIELDATA 
family serve double duty as computer input-output de- 
vices and as data transmission input-output devices. 
The group under current development constitutes a 
minimum group of general usage items, a number of 
which are in only partially militarized form. This mini- 
mum group will be augmented by future field teletype- 
writer equipments which will utilize FIELDATA code, 
and by advanced equipments now under study to pro- 
vide specialized input-output capabilities. Since the 
items are mostly quite conventional, detailed descrip- 
tions are omitted here in order to save space. 

The specialized embolic equipments in FIELDATA 
provide cryptographic security, interconnection of in- 
put-output and transmission transducers, and tie-in of 
FIELDATA circuits to existing teletypewriter circuits. 

Three major items of special embolic equipment are 
being developed for FIELDATA. The CV-689 is a spe- 
cial cryptosecurity adaptor which permits an existing 
type of security equipment to be inserted just before the 
transmission transducer in any FIELDATA data trans- 
mission assembly, thus providing cryptographic secu- 
rity. 

The CV-690 is a device which provides supervisory 
control, error control, and synchronizing buffer facilities 
for connecting paper tape or magnetic tape units to the 
AN/TSQ-32 or 33. Although future plans call for sim- 
ilar special militarized embolic equipments for other 
kinds of input-output equipment such as card equip- 


ment none is now under development. However, rather 
minor modifications of commercial Collins Kinecard 
equipment will permit nontactical employment of 
AN/TSQ-33 equipment for card transmission, and at 
least some versions of the MOBIDIC will include card 
equipment which, through the computer, can reach 
transmission facilities. 

Although FIELDATA concepts make provision for a 
wide variety of error control and supervisory control 
systems only the particular system used in the CV-690 
will actually be used initially except for computer-to- 
computer transmission, since other embolic devices us- 
ing different error control or supervisory control schemes 
are now under development. It is expected that when 
experiments determine the actual frequency and inter- 
relationship of errors for particular transmission trans- 
ducers, more effective schemes will be devised. However, 
the very simple and easy to implement two-dimensional 
(interlaced) parity error detection scheme followed by 
request-back or rerun-type error correction used in the 
CV-690 is particularly easy to implement, and is ex- 
pected to suffice for initial testing. 

The CV-691 Data Concentrator is a device designed 
to bridge the gap between existing large-scale 60- and 
100-wpm teletypewriter facilities and FIELDATA 
equipment. Although all FIELDATA processors have 
normal provision for a paper tape reader which accepts 
teletypewriter tape as well as one for FIELDATA tape, 
there exists a significant need, especially at locations 
where FIELDATA processors might not be available, 
to accept multiple channels of teletypewriter informa- 
tion and convert it into FIELDATA form to take ad- 
vantage of FIELDATA transmission transducers and 
error control, permit recording on FIELDATA mag- 
netic or paper tape, permit printing on FIELDATA 
high-speed printers, simplify entry into FIELDATA 
computers, etc. The CV-691 accepts up to 25 (or 50) 
teletypewriter inputs, stores the information in a buffer 
core memory (made up of the same memory planes as 
used in MOBIDIC), assembles it into message blocks, 
converts it into FIELDATA form, applies the same er- 
ror and supervisory control as the CV-690, and emits 
the data in FIELDATA form at rates up to 2400 bits/ 
second (3000 words per minute). The receive side per- 
forms the inverse functions. 

It is expected that as more and more of the voice and 
other analog communications systems convert to pulse 
code modulation and other digital forms that additional 
types of FIELDATA embolic equipment will be re- 
quired to perform the error control, supervisory func- 
tions, and buffering/synchronization necessary to tie 
input-output transducers to their digital bit streams. 

The FIELDATA family is an attempt to create an 
integrated family of data transmission equipments to 
meet Army needs. Though lacking many of the features 
and equipments of an ideal family of data transmission 
equipments, it will make available in experimental 
quantities by the end of 1960 the first integrated family 
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of equipments for experimental establishment of a truly 
integrated communications/data processing system. It 
will be the first system in which data processing and 
communications equipment both utilize the same input- 
output and storage devices, the same voltages, im- 
pedance levels, codes, and other common interconnec- 
tion characteristics, and in which the equipments are so 
designed that in many cases the only way to determine 
whether a device is used for communications or data 
processing is to look at its specific application in the 
system. 

Appendix I 

FI ELD AT A Alphanumeric Code 

A key step in achieving the required compatibility 
between the various elements of an automatic data sys- 
tem is the adoption of a common “language” for the 
storage and transmission of data throughout the sys- 
tem. The basic 6-bit alphanumeric code for use in this 
family shall consist of 2 indicator bits and 4 detail bits. 
The pattern of character assignment for the code is as 
follows with the 2 indicator bits determining the choice 
of column and the 4 detail bits determining choice of 
row in Table II. 

Appendix II 

FI ELD AT A Supervisory Code 

The FI ELD AT A supervisory code is used for message 
headings, dialing, multiplex identification, supervisory 
control, and other activities associated with secondary 
data. This code is similar to the FIELDATA alpha- 
numeric code used for primary data. It also consists of 2 
indicator bits and 4 detail bits. The pattern of control 
assignment is as follows with the 2 indicator bits deter- 
mining the choice of column and the 4 detail bits de- 
termining the choice of row in Table III. When it is 
not necessary to provide alphabetic supervisory in- 
formation only the latter two columns are used. In this 
case when the basic 8-bit FIELDATA form is used, 
an OR of the first indicator bit and the tag bit will 
provide clocking for the 96 legitimate characters of the 
8-bit form. 


TABLE II 



00 

(Upper and 
Lower Case) 

01 

(Upper and 
Lower 
Case) 

10 

Upper 

Case 

11 

Lower 

Case 

0000 

Master Space 

$ 

K 

) 

0 

0001 

Upper Case 

\ 

L 


1 

0010 

Lower Case 

y. 

M 

+ 

2 

0011 

Tab. 

o 

N 

< 

3 

0100 

Car. Ret. 

< 

O 

= 

4 

0101 

Space 

A 

P 

> 

5 

0110 

A 


Q 

— 

6 

0111 

B 


R 

$ 

7 

1000 

C 


S 

* 

8 

1001 

D 


T 

( 

9 

1010 

E 


U 

ft 

/ 

1011 

F 


V 


* 

1100 

G 


w 

? 

/ 

1101 

H 


X 

t 

1110 

I 


Y 

y 

Special □ 

mi 

J 


Z 

Stop ®. 

Back Space 


TABLE III 



00 

01 

10 

11 

0000 

BLANK/IDLE 

k 

Dial 0 

Ready to Transmit 

0001 

Control Upper 
Case 

1 

Dial 1 

Ready to Receive j 

0010 

Control Lower 
Case 

m 

Dial 2 

Not Ready to Re- 
ceive 

0011 

Control Tab. 

n 

Dial 3 

End of Blockette 

0100 

Control Carriage 
Ret. 

0 

Dial 4 

End of Block 

0101 

Control Space 

P 

Dial 5 

End of File 

0110 

a 

q 

Dial 6 

End of Control 
Block 

0111 

b 

r 

Dial 7 

Acknowledge Re- 
ceipt 

1000 

c 

s 

Dial 8 

Repeat Block 

1001 

d 

t 

Dial 9 

Spare 

1010 

e 

u 

Start of Control 
Block 

Interpret Sign 

1011 

f 

V 

Start of Block 

Non-Interpret Sign 

1100 

g 

w 

Spare 

Control Word Fol- 
lows 

1101 

h 

X 

Spare 

S.A.C. 

1110 

i 

y 

Spare 

Special Character 

mi 

j 

z 

Spare 

Delete 
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A High-Accuracy, Real-Time Digital Computer 
for Use in Continuous Control Systems* 

W. J. MILAN-KAMSKIf 


I T HAS become evident during the last few years that 
the accuracy requirements of analog computers 
have become too difficult to be easily satisfied. The 
rising pressure to achieve better computational ac- 
curacy has led to significant improvements in the com- 
putational techniques used in analog computers. These 
new improvements have made it possible to achieve a 
high degree of precision so that a 0.1 per cent accuracy 
has gradually become a realistic figure in many analog 
machines. 

However, present-day analog computer technology is 
completely helpless if accuracy requirements approach 
the magnitude of 1 part per million, or 0.0001 per cent. 
The only available computers which can achieve this 
degree of accuracy are obviously digitial computers. 

Many attempts have been made to design digital 
computers so that they might be used as direct replace- 
ments for analog computers. However, a rather unex- 
pected difficulty has arisen. Digital computers, which 
have received a great deal of publicity as being the 
fastest computational tools, are extremely slow when 
compared to analog computers. Since the comparison is 
made between digital and analog computers, the opera- 
tion of the digital computer must be such as to satisfy 
the bandwidth requirements of the analog computer. 
By this equivalence, the bandwidth of a digital com- 
puter can be defined as the bandwidth of an equivalent 
analog computer. 

There are three distinct approaches in solving the 
problem of designing high-accuracy, real-time digital 
computers. All three of these approaches are directed 
toward building high-accuracy digital computers which 
can replace analog computers in applications where ac- 
curacy requirements exceed present capabilities of these 
machines. 

At least one approach has come from engineers whose 
experience and background have been chiefly in the 
field of analog computers. Their basic approach was to 
replace various analog computer elements by equivalent 
digital operational blocks. For example, an integrator 
which consists of a motor with appropriate velocity con- 
trol can be replaced by a reversible counter; a potentio- 
metric multiplier can be replaced by a digital element 
which is called a rate multiplier, and so on. 

* Presently being developed under a subcontract from the Mili- 
tary Products Dept, of Detroit Controls, in Norwood, Mass., for the 
U. S. Navy. 

f Epsco, Inc., Boston, Mass. 


Since the operation of a computer of this type is in- 
cremental, its design approach led to the development 
of a family of computers called incremental digital com- 
puters. 

The second approach was to translate the problem 
into a differential equation and then to solve the differ- 
ential equations by integration. Since the solution of 
differential equations is done using finite increments, 
the family of digital differential analyzers is closely re- 
lated to the family of incremental computers. The out- 
put function of incremental computers and of the digital 
differential analyzers is determined by the increment of 
the input function and by the internal state of the ma- 
chine. These computers, therefore, can be regarded as 
deterministic transducers with infinite memory. 

The third family of real-time digital computers is rep- 
resented by machines which go through a complete com- 
putational cycle every time a new input sample is taken. 
These computers normally adopt computational tech- 
niques which have been developed in programming 
general-purpose digital computers. 

These machines normally have short memories or, in 
many cases, no memory at all. Their output is always 
uniquely determined by the input. 

The latter group of computers is particularly suited to 
applications in which a number of problems must be 
solved simultaneously and concurrently. It is achieved 
usually by interleaving several programs. 

The computational speed of digital computers is 
usually defined as the number of additions or multiplica- 
tions which the computer can perform within a certain 
period of time. This computational speed is extremely 
high when compared to the computational speed of a 
desk calculator. In real-time computation, however, the 
speed of operation is defined as the ability of the com- 
puter to generate output functions, which vary rapidly 
with time. Not only must the output function contain 
large values of higher order derivatives, but also must 
not be delayed by the finite computational time of the 
computer. The transfer function of real-time computers 
is often complicated and usually contains trigonometric 
functions. If a high degree of accuracy is desired, the 
word length required may be as large as 30 binary digits 
or more. 

' It is possible to show that a high-accuracy machine 
has a limited ability to generate output functions which 
contain large values of output function derivatives. The 
computational time increases very rapidly as the word 
length increases. 
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The design of a real-time digital computer is usually 
based on an input-output accuracy specification and on 
the bandwidth requirements. For a digital computer, the 
bandwidth requirement can usually be expressed in 
terms of the amplitudes of output function derivatives. 
Maximum possible values of the derivatives can nor- 
mally be determined by analyzing the geometry and the 
dynamic character of the output function. 

The first trial in the determination of the maximum 
permissible computational time can be accomplished by 
first calculating the greatest possible velocity of the out- 
put function, and then by selecting a computational 
time such that the change of the output function with- 
in the computational time will not be greater than a 
maximum permissible error. 

Errors due to quantization, truncation, round-off, 
function approximation, etc., must be considered sepa- 
rately as additional system errors. In certain problems, 
the computational time calculated from the investiga- 
tion of the maximum output velocity may be extremely 
short. Extremely short computational times can be 
realized with incremental computers. However, internal 
rates of several megacycles are necessary in order to 
construct incremental machines which have equivalent 
bandwidths equal to the bandwidth of analog computers 
and accuracy of 1 part in 10,000. 

In many applications, long computer memory is un- 
desirable as, for example, in all real-time control and 
stabilization computers. Computer response to step in- 
puts in target tracking applications must be excellent. 
Errors must be self-correcting, and the accuracy of the 
computer must be independent of the accuracy of pre- 
vious computations. 

These requirements cannot be readily satisfied by 
purely incremental computers. The selection of a cer- 
tain type of real-time computer should be based on the 
specific requirements of each problem. 

The best results can be achieved if the design of real- 
time computers is specially tailored to each problem. 
The specification for a real-time computer is usually 
determined by accuracy requirements and the character- 
istics of the time function to be controlled. 

There are usually several other factors which are nor- 
mally well specified; for example, the weight and size 
of the computer and the type of hardware to be used. 
These requirements, combined with the environmental 
specification, usually determine the maximum practical 
internal rate of the machine. 

Several design parameters must be considered to de- 
termine the optimum combination of computer ac- 
curacy, internal speed of operation, approximations 
used, sampling rate, and the time of computation. 

The maximum permissible computational time can 
be determined by analyzing the nature of the output 
function. The output function can always be expressed 
in terms of a Taylor series. The actual mathematical 
manipulation can be quite involved. It may also be 
difficult to determine the maximum possible values of 


all the derivatives of the output function. However, if 
the motion of a physical object is considered, it is usually 
sufficient to analyze only the first two or three deriva- 
tives in order to describe adequately the output func- 
tion. Rapid changes in acceleration are very rare, and, 
therefore, higher order terms of the expansion can be 
disregarded. 

The Taylor expansion can be regarded as a polynomial 
in t. It is possible then to substitute a polynomial for 
the output function. The period of time in which the 
polynomial substitution is valid can be determined by 
calculating the difference between the polynomial ap- 
proximation and the output function. The difference 
must be less than the maximum permissible error. The 
higher the order of the polynomial used, the longer the 
period of time over which the substitution is valid. The 
computational time can then be determined by the time 
it takes the output function to diverge by a certain pre- 
determined amount from the polynomial approxima- 
tion. 

The minimum sampling rate and the maximum 
computation time can then be determined for each order 
of the polynomial used as the output approximation. 
Computation times are progressively greater as the or- 
der of the polynomial increases. 

The determination of the coefficients of the poly- 
nomial require the determination of the appropriate de- 
rivatives of the output function. The polynomial co- 
efficients can be calculated on the basis of several sam- 
ples computed at given time intervals. Using Newton’s 
backward interpolation formula, it is possible to deter- 
mine the coefficients of the polynomial by simply cal- 
culating the differences on the basis of several samples 
of the output function. (See Fig. 1.) 


X| 

X2 

*3 



INPUT 

FUNCTIONS 


COMPUTATION 
TIME 


POLYNOMIAL 
APPROXIMATION 
TO THE OUTPUT 
FUNCTION 


Fig. 1 — Computer block diagram. Output function is compensated 
for the computational delay by means of a polynomial substi- 
tution. 


The computation of function differences involves sub- 
traction. Since random errors are not correlated, they 
are not subject to cancellation. In systems in which 
random and bias errors are of the same magnitude, a 
second order polynomial is probably the highest order 
which can be practically used. The computation of the 
terms of the polynomial makes it necessary to memorize 
the results of several computations. In other words, it is 
impossible to construct a computer which uses poly- 
nomial approximation and has no memory. However, 
the memory is relatively short. If a second order poly- 
nomial is used, the computer memory is equal to only 
three computation cycles. 
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The use of a polynomial approximation to the output 
function offers an added advantage which may be im- 
portant in certain applications. The output function 
can be generated in steps which are smaller than the 
maximum permissible errors. The need for this form of 
output may arise if a high performance servo is con- 
trolled by the output of the computer. It can be seen 
from Fig. 2 that the actual output of the computer con- 
sists of a sequence of polynomial segments and that 
there is a discontinuous jump from a polynomial to the 
polynomial whose terms have just been calculated. This 
discontinuity can be made as small as desired. The re- 
duction of the output function steps, however, can be 
achieved only at the expense of the computational time. 
It is possible, then, to trade computation speed for 
accuracy and vice versa. 



Fig. 2— Computer output function. Output function is 
approximated by polynomial substitution. 


In between the computational times, the output func- 
tion is not directly controlled by the input functions. 
However, the nature of the ouput function is such that 
it cannot possibly diverge from the approximated value 
by more than a certain predetermined value. This maxi- 
mum deviation can be calculated by taking the terms 
of the Taylor expansion of the output function which 
do not appear in the polynomial approximation. 

Once the sampling rate and the order of the polyno- 
mial approximation of the output function is deter- 
mined, it is possible to determine the bandwidth of the 
computer. The bandwidth can be calculated by evaluat- 
ing the accuracy of the computer as a function of the 
output function frequencies. 

The frequency of the output function is postulated, 
and the rms value of the errors due to the polynomial 
approximation is calculated. For every frequency, a 
certain value of the rms error can be determined. The 
bandwith of the computer can then be defined as the 
maximum frequency at which the rms error is still with- 
in the permissible limits. 


In all real-time control and stabilization computers, 
it is always necessary to compute some trigonometric 
functions. There are many ingenious schemes of com- 
puting these functions by using the incremental, tech- 
niques. All these techniques, however, suffer from the 
limitation of having infinite or very long memories. In 
the Epsco STARDAC Computer, the trigonometric 
functions are calculated using the Tchebycheff polyno- 
mials. Sine and cosine functions are usually needed 
simultaneously. In the Epsco STARDAC Computer 
they are calculated concurrently by using the powers of 
the argument and the multiplying the result by ap- 
propriate Tchebycheff coefficients. A very high degree 
of accuracy can be realized if the Tchebycheff polyno- 
mial is used within an interval of 0° to 90°. Simple logic 
is used to accommodate arguments outside of this range. 

In this high-accuracy, real-time system, error analysis 
is probably the most important phase of the system 
design. All possible sources of accuracy-limiting factors 
must be carefully analyzed. 

In the applications in which the computational time 
cannot be disregarded, a polynomial substitution for 
the output function is used to offset errors due to the 
computation time. The polynomial substitution can be 
only approximate and consequently an erroi^ is intro- 
duced. Truncation and round-off errors can be deter- 
mined by analyzing the number of significant digits lost 
in the computations. Errors introduced by the sub- 
stitution of Tchebycheff polynomials for the trigono- 
metric functions can be determined. 

Output errors due to the errors present in the input 
functions must be carefully analyzed since these errors 
determine the maximum realizable accuracy of the sys- 
tem. 

The accuracy of the input function has a profound 
effect on the decisions which must be made in the design 
of the computer. If the computer is designed correctly, 
the errors it introduces are normally smaller than the 
output errors caused by the errors in the input func- 
tions. However, the propagation of the input errors 
through the computer must be carefully analyzed since 
some of them can be amplified in the computer more 
than others. The input function errors can be divided 
into two categories, bias and random. 

Bias errors can be defined as those whose magnitude 
is consistent. In other words, the magnitude of an error 
can be predicted with a certain accuracy on the basis 
of the errors present in several previous measurements. 
On the other hand, random errors can be defined as un- 
predictable. The random error in any sample has a 
probability which is independent of the errors present 
in the previous samples. 

The propagation of these errors through the com- 
puter can be traced easily by using appropriate partial 
derivatives. This error analysis is well known to those 
who have designed fire control computers. However, the 
relative magnitude of bias and random errors in real- 
time digital computers is normally different from the 
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Fig. 3 — Computer with covers in place. 


Fig. 6 — -Digital computer module, disassembled. 


Fig. 5 — Digital computer module, assembled. 


Fig. 4 — Computer with covers removed, showing access for servicing. 

relative magnitude of bias and random errors in, for 
example, radar returns. 

In real-time control computers, input random errors 
are usually small and they are very often introduced 
only by the input quantization. The quantization ran- 
dom error has a rectangular probability distribution 


with a maximum possible error equal to one half of the 
least significant digit. 

Various methods can be used in order to minimize the 
effect of random errors on the output function. Input 
random errors are particularly harmful if differences are 
employed in the computation of the polynomial which 
is used as the approximation to the output function. 
For example, if a second order polynomial is used, the 
third difference is calculated and is used to smooth out 
the output function. This compensation is valid only if 
the noise level is such that the third difference of the 
output function is much smaller than the measured 
third difference due to random input errors. This meth- 
od, however, leads to relatively complicated equations. 
It is often possible to obtain a significant improvement 
by simply reducing the quantization errors. This is ob- 
vious since bias errors are not amplified as much in the 
computation of differences as are random errors. 

Accuracy analysis would not be complete without a 
description of the selection of the control equations. In 
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real-time, digital, control computers, accuracy can be 
greatly limited if a large number of mathematical opera- 
tions must be carried out in order to compute the out- 
put function. Long computations are undesirable for 
two reasons. Large numbers of computations are time- 
consuming; and also, in each arithmetic addition as 
much as one half of the least significant digit may be 
lost. It is then necessary to know exactly what is the 
largest possible number of operations which might be 
necessary under the worst possible combination of in- 
put variables. The number of computations, sometimes, 
is very difficult to predict. This is particularly true if 
the computer function involves division and if the de- 
nominator, under certain conditions, approaches zero. 

Unfortunately, this condition arises often in all prob- 
lems in which spherical geometry is involved ; this hap- 
pens, for example, if it is necessary to compute an angle 
whose tangent is determined by a ratio of two expres- 
sions which, in turn, are determined by some other 
trigonometric functions. The angle itself is uniquely de- 
termined for the whole interval from 0° to 360°; how- 
ever, the tangent is discontinuous at 90° and 270°. 

In the STARDAC Computer, this problem was solved 
by the use of an iterative routine, which made it possible 
to compute the argument even if the tangent of the 
angle approached infinity. 

As mentioned before, the STARDAC Computer has a 
built-in sine-cosine function generator. First a number is 
substituted for the value of the argument and the com- 
puter calculates the sine and the cosine. Then the sine 
of the argument is multiplied by the denominator and 
the cosine of the argument is multiplied by the numera- 
tor. In the second step of the computation, a comparison 
is made between the two products. The difference is 
then added directly to the number which was sub- 
stituted for the argument. Then the cycle is repeated. 

Mathematical justification for this operation is almost 
self-evident if the numerator of the fraction is repres- 
ented as sin A and the denominator as cos A. The term 
which is added to the argument can be expressed as 


A = sin OK cos A — cos 6K sin A , 

but 6 was selected at random and was not equal to A. 
So the equation can be rewritten as: 

A = sin ( A + A d)K cos A — cos(^4 + A 6)K sin A 
or 

A = K sin Ad. 

For small Ad, the value of A is equal to KA6. The func- 
tion converges rapidly if the value of the coefficient K 
is close to unity, and in a few iterations the error be- 
comes negligible even for systems which require ex- 
tremely high accuracy. The program is simple. No 
ambiguities arise and the arithmetic operations contain 
only multiplications, additions, and complementing. All 
these operations are particularly easy if performed in 
straight binary code. 

The packaging techniques used in the construction of 
the STARDAC Computer can best be presented by 
referring to Figs. 3-6. Fig. 3 illustrates the computer 
complete with power supplies and input-output equip- 
ment. Fig. 4 shows the computer with covers removed 
and the frames pulled out for servicing. Figs. 5 and 6 
show typical modules used in the computer. 

It is felt at Epsco that a family of real-time computers 
such as described in this paper will find broad applica- 
tion in the field of high-accuracy real-time control sys- 
tems such as stabilization computers, fire control com- 
puters, navigation computers, autopilots, etc. 

A computer whose design is based on the approach 
outlined in this paper can offer an ideal solution to the 
problem of maintaining extremely high internal ac- 
curacy. It is believed that the need for these computers 
will grow together with the need for miniaturized, gen- 
eral-purpose computers. It is felt that this new type of 
computer will soon establish itself as a member of the 
family of computers together with the stored program, 
general-purpose machines and analog computers. 
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The Man-Computer Team in a Space Ecology 

J. STROUDf and J. McLEOD i 


I N OUR coming adventures in the conquest of space 
we have more problems than the well-publicized 
one involved in merely getting off the earth; once 
in space, man has the problem of survival in an ex- 
tremely hostile environment. We are adapted to life on 
this earth, not in space nor on any other planet of which 
we have knowledge. In our first frantic efforts to find 
solutions to the many problems attendant on this, we 
might reflect that once upon a time man was not 
adapted to live on the surface of this earth either. In 
order to survive on land, the many specialized cells 
which cooperate to make up modern man found it 
necessary to bring their environment with them when 
they crept ashore. These cells now live and replicate in 
a miniature sea in which the temperature and even the 
salinity are controlled at about the same values as those 
of the ocean in which they once lived. 

So it seems that the solution to the problem of sur- 
vival in space is obvious: To stay alive man must again 
take a reasonable facsimile of his accustomed environ- 
ment with him. 

We have found the answer! — Or have we? The only 
way we can think of implementing our solution would be 
to arrange an “air lift” to man’s abode in space, and thus 
keep him supplied with his every need. The trouble with 
this is the high cost of the “lift.” Current American 
freight rates from earth to a minimum satellite orbit, 
the staging area for our journey into space, happen to 
be about $30,000 a pound. And present indications are 
that it will be a long time before the rates are reduced 
by a factor of ten — and a lot longer before they can be 
considered “reasonable.” So when we consider that a 
man eats and breathes his own weight in food and oxy- 
gen every two months, and that he will certainly need 
other supplies, we find that our first solution just isn’t 
feasible. 

Stated another way, we are forced to conclude that 
to survive and replicate himself in space, man must be 
equipped before he leaves his staging area to “make do” 
with what he finds in space, namely, abundant radiant 
energy from the sun and some very raw, raw material 
from an occasional asteroid. 

Certainly there is sufficient energy; more than a kilo- 
watt per square meter at the distance of the earth from 
the sun. This is an amount which, if applied to a man’s 
hide, would be sufficient to run him if he were only an 
efficient transducer. It would also be enough if applied 

t Naval Electronics Lab., San Diego, Calif. 

X Convair-Astronautics, San Diego, Calif. 


to the hide of the space ship, or better, an extension of 
it, to propel it about the solar system from the inside of 
the orbit of Mercury to the orbit of Jupiter. 

Of course, as is sometimes said of the impossible, 
solving the problem of converting asteroid chips to sup- 
port life in space may take a bit longer. But with en- 
ergy and sufficient know-how, it can be done. Theoreti- 
cally man can maintain himself in a space-ecology in- 
definitely. In fact, we can say with some certainty that 
someday precisely this process will take place, and that 
the human population of the solar system may very well 
increase a millionfold in a couple of thousand years, 
with 99.9 per cent of this population living in comfort 
and even luxury in little artificial worlds in space. 

Most of the knowledge which allows us to live on this 
earth is currently locked up in our own genes and those 
of the plants and animals which do the real work of our 
planetary ecology, while we exist as the prime parasites. 
To re-educate the necessary genes to live in space would 
be at best a slow evolutionary process, which we have 
neither the knowledge nor the time to accomplish. We 
must find a fast, a revolutionary, way to furnish man 
with the necessary know-how. 

Admittedly, to satisfy our reactionary genes will take 
a lot of know-how. It is beyond the scope of this paper 
— that is to say, it is beyond the combined abilities of 
the two authors — to say just how much a man or a 
colony must know to survive in a space ecology. 

Or, much more to the point, considering that the cost 
of a first-class (and only class) one-way ticket just to the 
staging; area for a space mission is about half a mega- 
buck per person, the question becomes: How few people 
can collectively know enough to survive in space? So 
far as we know no one knows, or is making a serious 
effort to find out. This is either a sad commentary on 
the direction of our space effort or a happy one on our 
security system — if anything about security can be 
happy. However, if it is true that we are not making 
an all-out effort to answer this question, to point out 
that it is already frighteningly late to begin such a 
fundamental task is to belabor the obvious. Suffice it to 
say that we must get on with the task as though our 
very existence depends on it, as well it might. 

But no matter what amount of know-how is found to 
be required to exist in space, it is the burden of this 
paper that computers are the only means by which the 
necessary knowledge can be made available to men out 
there at freight rates we can afford. 

Having stated the problem and suggested a broad 
solution, we leave the details to the fertile imaginations 
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of our bright young colleagues while examining in more 
detail one important aspect of the problem: the man- 
computer relationship which will be required. 

First let us take a brief look at man. His internal 
memory is so unreliable that he must depend on external 
aids, usually written records. These must be 50 to 75 
per cent redundant to be understood, and they cannot 
contain more than 100 bits of information per square 
inch and still be read. Microfilming can increase the 
density of information but it cannot improve man’s 
reading rate beyond the five bit per second human limi- 
tation. 

Moreover, there are very severe limitations as to 
what man can stand physically and emotionally. His 
sensitivity to his environment, already mentioned, may 
be reflected in a quite understandable anxiety concern- 
ing his own well-being under one set of conditions and 
boredom under another. Neither is conducive to reliable 
performance. 

Computers, however, can read non-redundant ma- 
terial with densities as great as a billion bits per square 
inch, and they can read at rates of the order of a million 
bits per second. That they can do literature searches and 
prepare abstracts is hardly news, but it is also true 
that they can perform feats of symbolic logic and deduc- 
tive “reasoning.” Moreover, they don’t get dangerously 
upset under some conditions and negligently bored 
under others. 

For these and other reasons, computers are being 
widely used on earth to augment man’s efforts in sci- 
ence and industry, even when the salary and overhead 
of the individuals being augmented may not exceed 
$30,000 a year. 

Certainly then, all will agree that when man is sent 
into space where the cost of his per diem and transporta- 
tion is nothing short of fabulous, he must have com- 
puter support. 

Note that we have used the words “augment” and 
“support.” Even in our enthusiasm for some of the su- 
perior capabilities of computers we do not suggest that 
man will continue to allow them to explore space with- 
out him, although it is evident that several generations 
of computers will have acquired quite a wide experience 
in space operations before man ventures forth. 

Eventually man must go into space; if for no other 
reason, “because it is there!” However, for a sufficiently 
great number of people to go in one colony to collective- 
ly know enough to survive is not practical with any 
^earth-orbit ferry which we can expect to have in the 
foreseeable future. 


For these reasons the man-computer team proposed 
here for a space ecology does not include a large number 
of human specialists, but rather a few humans of un- 
usually broad background who will only have to be 
able to ask the right questions and do any inductive 
reasoning which might be required. The computers will 
select and supply all detailed information needed, make 
necessary computations, and make all decisions which 
can be reached by symbolic logic or deductive processes. 

We must recognize, however, that to use computers 
in space effectively as a source of most of the know-how 
required for man to thrive will require improving the 
art more than a little, and mostly in the area of estab- 
lishing man-computer rapport. 

At present, many people are inclined to refer all too 
freely to computers as “high-speed” morons. It is just 
possible that this is because the esoteric art called pro- 
gramming is so esoteric that it is practically a cult, 
headed by the Senior Programmer as High Priest. If 
computers do, in fact, sometimes behave like morons, 
perhaps we should ask if anyone has taught them to do 
otherwise. We think nothing of spending twenty-odd 
years programming a two-legged computer. And the 
amount of effort that has gone into developing the ma- 
terial required is measured in man-centuries. 

In contrast, we have had high-speed electronic com- 
puters for less than twenty years — only a few computer- 
generations. We take, at most, a few man-months to 
“educate” them, and then say, in effect, “You do exactly 
what I say in exactly the way you are instructed to or 
I’ll brain-wash you!” If we are to get along well with 
our computers, we are going to have to give a great deal 
more thought and effort to their education than we have 
to date. At present, we are paying some pretty fancy 
prices for a very inferior brand of education for some 
rather bright hardware because we got started on the 
wrong track. Instead of instructing them by rote, we 
should teach our computer to be actively curious, to at- 
tempt to find a few answers of its own. 

Such a radical reorientation of our approach to com- 
puter education is going to be quite painful to some. As 
the old man said, “If you want to train a dog, ya gotta 
be smarter than the dog be!” If the same can be said of 
computers, man) “trainers” will be immediately dis- 
qualified. Do you know what makes you curious? Do 
you know how you distinguish sense from nonsense? 
Neither do we, precisely. But we had better find out — 
and teach our computers — if we are to survive in space 
or anywhere else! 
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The RCA 501 High-Speed Printers — 
The Story of a Product Design 

C. ECKEL f and D. FLECHTNERf 


T HE 501 High-Speed Printer is an output device 
of the RCA 501 Electronic Data Processing Sys- 
tem. This system is fully transistorized. It is ex- 
pandable in both the area of the high-speed memory 
and the input-output devices. In the specific case of 
the High-Speed Printer, this expandability takes the 
form of optional use of the printer mechanism either as 
an on-line device or an off-line device. 

Initially, the 501 product plan was to design and 
produce a minimum cost EDP system. Therefore, the 
first printer specifications called for on-line operation 
with the printer being driven directly by the computer. 
This accomplished two things. It held the cost of printer 
electronics to a minimum, but still allowed the system 
to have a high-speed output capability. 

Subsequent product planning developed the need for 
system expandability, that is, a system which could be 
enlarged at the user’s convenience if the work load in- 
creased. To meet this requirement, a program was also 
started to design buffer electronics to allow the printer 
to be run directly from magnetic tape — off-line. 

Fig. 1 shows a scale model of a basic RCA 501 system. 
Information is entered through the paper tape reader 
at 1000 characters per second. Printed output is avail- 
able from either the monitor printer or the on-line 
printer. Fig. 2 shows an expanded system with provision 
for punched card input and output and additional mag- 
netic tape storage. The High-Speed Printer is now an off- 
line device with buffer electronics permitting it to op- 
erate from magnetic tape. 

The specifications set up for this printer were that it 
should be capable of printing at least 600 lines per min- 
ute with 120 columns per line. It should be capable of 
producing at least an original and three carbon copies, 
offset masters and ditto masters, and contain the neces- 
sary logic to be applicable in all types of format print- 
ing. Not an integral part of the design specifications, 
but perhaps most important over- all, were the criteria 
that the printer should be low enough in manufacturing 
cost, but high enough in performance so that these fac- 
tors alone could ward off early obsolescence. A corollary 
to this was the fact that the length of design cycle must 
be held to a minimum and the design cost should be rea- 
sonable. 

The mechanism selected for the printer was of the 
“flying wheel” variety. Basic techniques employed in 
printers of this type are well known, therefore we shall 

f Electronic Data Processing Div., RCA, Camden, N. J. 



Fig. 1 — Basic 501 system. 



Fig. 2 — Expanded 501 system. 


only dwell on the functional aspects that illustrate the 
product development. 

Fig. 3 shows a flow diagram for the on-line printer. 
Information to be printed is stored in the high-speed 
memory of the computer and the format is controlled 
by a program in the computer program control. The 
memory contents are scanned and a line is printed with 
each revolution of the print wheel cylinder. Synchro- 
nization of the memory and the character identity com- 
ing into position to be printed is accomplished by a 
photoelectric code disk assembly, mounted on the same 
shaft as the print cylinder. The coded bits emerging 
from this disk are mechanically phased with respect to 
the character they represent on the print cylinder. This 
allows sufficient time for a particular character to be 
compared for its occurrence in the computer’s memory 
and if it exists, a bit is placed in the shift register cor- 
responding to the proper print column. A clock pulse 
from the computer then causes printing of this char- 
acter identity. The process is continued until the com- 
puter memory has been examined for all 51 possible 
printing characters. The computer next generates a sig- 
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Fig. 3 — On-line flow diagram. 


nal indicating the amount the paper should be moved, 
and upon receipt of a return signal indicating that the 
paper has been moved, the entire process is repeated. 

Fig. 4 illustrates the off-line operation employing 
suitable buffering logic. The buffer unit is designed to 
accept one line of information at a time, from magnetic 
tape, store it temporarily, and then print it out. The 
line is stored in a core memory, the input to which con- 
sists of a coincidence between character identity and 
column location. The memory is clocked out by the 
photoelectric code disk assembly as each character 
identity comes into print position. 

Printing is normally accomplished in an asynchro- 
nous manner. That is, provision is made to determine 
when all character identities to be printed on a line have 
been printed. Upon receipt of this signal, another line of 
information is immediately read in from magnetic tape 
as the paper is shifted. In this manner, basic printing 
speeds may exceed 600 lines per minute reaching as high 
as 900 for numeric printing. The logical circuitry in this 
area also serves as an accuracy check on the number of 
characters printed vs the number of characters which 
should have been printed. 

In order to control the printed format, several fea- 
tures have been incorporated. First, by means of a plug- 
board, incoming information may be tabulated to any 
of 24 predetermined positions; this same feature may 
also be used to delete information which is not wanted. 
It is also possible to effect multiple printing of the same 
data on one line, again by use of the plugboard. 

Fig. 5 shows the printer mechanism, which is used for 
either the on-line or off-line operation. 

Now that the printer has been described, the follow- 
ing discussion will outline some of the factors which in- 
fluenced the product design. 

The need for economical high-speed printers for com- 
puter output has persisted. Both the electronic and 
electromechanical printers were considered at RCA. 
From an economic and state-of-the-art point of view, the 
electromechanical seemed more promising. Rotary 



Fig. 4— Off-line flow diagram. 



Fig. 5 — Picture of printer. 


wheel printers (Fig. 6) looked to us to be the best com- 
promise as far as simplicity of mechanism and high 
printing speed are concerned. The earlier equipment 
was designed using mechanical printers of this type. 
Since we already had experience with this type of print- 
ing mechanism (certain problems were known to be 
problems) the new product development for the RCA 
501 system consisted of refinements and improvements 
in the techniques. We already knew how to make good 
print wheels, and how to be consistent with the solenoid 
fabrication, and were familiar with the many other 
necessary techniques. 

An area that we felt needed some investigation was 
that of high-speed paper shifting. At the time the project 
was initiated, we had a development design of an elec- 
tromechanical detent spring clutch which gave promise 
of very high-speed paper shifting. We found, however, 
that a magnetic clutch, suitable for paper shifting, 
though not quite as fast, was already commercially 
available, and so it was adopted. 
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Fig. 6 — Print wheel area (ribbon removed). 


In order to appreciate the problems involved in fabri- 
cation of the print solenoids let us examine for a moment 
the general concept of print quality. Admittedly, this is 
a subjective type of thing and depends primarily on the 
ability and the resolving power of the human eye. 

We have found, for example, that it is possible to de- 
tect a vertical misalignment of about 0.005 inch between 
adjacent characters in a line of print without much dif- 
ficulty. In the conventional typewriting or typesetting, 
where similar misalignments occur more frequently in a 
horizontal direction, the effect is not generally displeas- 
ing. People are more familiar with this type of printed 
copy, and they usually accept it without notice. How- 
ever, they detect any vertical misalignment quickly and 
question its reason for existing. 

Fig. 7 shows the word “link” in which the i is 0.005 
inch above a line and n is 0.005 inch below the line with 
l and k on a line. Here, also, you can see the irregular 
horizontal spacing in the top word, a design require- 
ment of wheel printers. We readily accept this difference 
in spacing because of the differing widths of letters. 

Obviously, vertical misalignment is a normal conse- 
quence of rotary wheel printers which must be mini- 
mized. The primary way in which this is done is to as- 
sure that all 120 solenoids are as near alike as possible 
both electrically and mechanically. 

From an ease of manufacturing and of maintenance 
standpoint, we would have liked to have placed the 
print solenoids in two banks, one on each side of the 
print wheel center line, that is, 60 per side, so that all the 
solenoids would be similar. However, the problem of get- 
ting enough energy into and out of a solenoid, which is 
only 2/10 of an inch thick, without crosstalk, resulted 
in a design compromise of two banks of solenoids on 
each side. We found that with the two-bank design, we 
could assemble and pot the solenoids in groups of five. 
Fig. 8 is a representation of the solenoid area of the 
printer. 

The potted assembly of five solenoids is machined in 
a single operation so that the tolerances can be held. 


LINK 

LINK 

Fig. 7 — Normal printing and 0.005-inch misalignment 
above and below a line. 



This technique is similar to that used in fabricating dig- 
ital magnetic recording heads. By this means, we are 
able to make a good solenoid economically with the cor- 
rect tolerances built in, thus eliminating later assembly 
adjustment. Fig. 9 shows the solenoid area with a view 
of an individual unit. 

Another area of investigation centered in the code 
disk. This assembly has the function of establishing the 
angular position of character on the print drum and 
translating the character into coded notation. The 
code disk has perforations corresponding to the 7-bit 
code used in the RCA 501 system. Fig. 10 shows the 
code disk area. 

Here we were able to affect manufacturing economies 
by photo etching through a plate to obtain the coding. 

The logic of the on-line and off-line printers is imple- 
mented with circuit boards of standard configuration. 
Most are the same board types used elsewhere in the 
system (Fig. 11). 

The use of standard plug-in packages, plug-ins that 
are also used in the computer and the rest of the units 
of the system, also helps to keep manufacturing and 
service costs down. 

Design for simplification and ease of field mainte- 
nance meant that the logic should be straightforward 
and easy to understand. All necessary adjustments 
should be in convenient locations and the mechanism 
should be designed in modules which are easily replace- 
able, such as the print drum, ribbon drive, and paper 
shift assembly. In the on-line case, a small maintenance 
panel simulates the computer so that the unit can be 
serviced independently without tying up the rest of the 
system. 
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Fig. 9— Solenoid area. 



Fig. 10 — Code disk. 



Fig. 11 — Typical plug-in transistor circuit board. 


Conclusion 

In the computer field, the major problem encoun- 
tered in product design is time. The technology is ad- 
vancing at a rate which constantly makes new products 
obsolete in the design stage. The product design team 
must carefully weigh the technological advances which 
can be incorporated in a design against the need for 
production release so that the device can be made 
ready for sale. To insure that the product design re- 
mains saleable, the following three items are basic. 
First, the design should be functionally good. Second, 
it should be reliable, and third, it should be reasonably 
priced. When these characteristics are achieved in a 
product design, regardless of technological advance, the 
product will not become obsolete — it will remain mar- 
ketable. 


A Digital Computer for Industrial Process 
Analysis and Control 

EDWARD L. BRAUN f 


Introduction 


K 


MONG the more important reasons advanced for 
the relatively unexploited use of digital com- 
puters in industrial process control systems are 


1) a lack of knowledge concerning process dynamics, 

2) inadequate development of computers engineered 
for and suited to process control applications, and 

3) inadequate reliability of current digital computers. 


Our purpose here is to describe a computer which has 
been designed specifically for industrial process control 
applications. It promises to satisfy reliability require- 
ments, and can be of great utility even in the absence 
of complete information on the dynamics of a process. 
This type of machine can be used for either one or both 
of the following major functions: It can be used to ad- 
vantage in the quantitative determination of the effects 
of different controllable parameters on process perform- 
ance and also as a process optimization control com- 
puter. 


f Genesys Corp., Los Angeles, Calif. 
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We shall not consider here the dynamics of a particu- 
lar process nor attempt to present a quantitative picture 
of the benefits to be realized from the use of new instru- 
mentation and digital control computers in various in- 
dustrial processes. These are the subjects of recent and 
continuing studies by a number of organizations. These 
studies indicate that the utilization of machines which 
allow control to be based on process dynamics as well as 
steady-state considerations may offer, in particular 
areas, one or more of the following advantages: 

1) A reduction in capital investment for new process 
plants by the substitution of small responsive 
equipment and control systems for some of the 
mass and storage capacity on which many plants 
presently rely for stability and self-regulation 

2) A reduction in expenditures for raw materials, 
heating, cooling, catalysts, etc., as a result of more 
precise control 

3) Improved productivity 

4) Improved quality control 

5) Realizable, effective control for new processes ne- 
cessitated by technological progress, and which 
must function under conditions beyond the present 
limits of controlled process variables. 

Analysis of a Process 

The effectiveness of computer control of an industrial 
process is dependent to a large degree on the data avail- 
able concerning the effects of controllable parameters on 
pertinent characteristics of the process. However, this 
does not necessarily imply that the exact dynamics of a 
process must be known in precise analytical terms be- 
fore a process can be controlled. The fact is that control 
can be and is effected with only a qualitative knowledge 
of system behavior, coupled with the use of feedback 
methods. 

The objective of a process analysis is the determina- 
tion of the relationships between the major process 
parameters and the location of optimal operating re- 
gions for the process in terms of these variables. Once 
the optimum operating regions have been determined, a 
decision can be made in regard to which variables need 
be controlled and in what manner in order to maintain 
optimization of one or more characteristics of the proc- 
ess in the presence of disturbances that may arise. 

Let us consider now a relatively simple procedure for 
analyzing the effects of various parameters on system 
performance which can be incorporated into the special 
purpose computer to be described. The procedure con- 
sists essentially of specifying initially allowable varia- 
tions in a number of variables and then programming 
these changes in order to obtain data on their effects on 
the various characteristics of the process. The informa- 
tion entered into the computer prior to an investigation 
consists of the process parameters that are to be varied, 
the size of the incremental steps of the variation (vari- 
able over a limited range), and the upper and lower 
limits within which the variations must be kept in order 


to prevent upset of the process. Where two or more 
parameters are to be varied, the program will cause all 
combinations of these parameter values (within the im- 
posed limits) to be impressed upon the system. The 
only other quantity that need be entered is the stabiliza- 
tion time required for the process to stabilize after each 
programmed change. This delay is usually referred to as 
dead time or process lag. It allows for the occurrence of 
two events. First,, it permits a process variable to reach 
the steady-state value called for by a command in the 
program. Also, it provides time for the over-all process 
to adjust to this value. 

The values of the various quantities whose effects on 
the process are to be investigated are determined by 
suitable sensing devices (whose outputs usually are in 
the form of dc voltages) which are coupled to the com- 
puter via an analog-to-digital conversion system. A con- 
venient way of effecting the programmed incremental 
changes in the process variables is simply to generate 
commands which cause the set points of the various 
controllers in the system to be altered. Just prior to the 
initiation of each new “change” command, the com- 
puter causes the current values of all input and output 
variables of interest to be read out, either in printed 
form or graphically. 

A useful procedure that can be incorporated into the 
program is to cause a reversal in the sign of the subse- 
quent incremental commands whenever either the upper 
or lower limits of a particular variable are reached. This 
produces automatic cycling between preset limits of a 
given variable. A convenient method of programming 
changes in a second variable under investigation would 
be to cause it to be changed by a single increment each 
time the first variable reached a limit and reversed. A 
third variable would be advanced by one increment each 
time the second reversed, and this type of procedure 
could be used, for as large a number of variables as de- 
sired. 

Specifically, the analysis procedure outlined takes 
place as follows. First, the variables of interest are pro- 
grammed for relatively large incremental changes within 
a limited range around their nominal set points, the lat- 
ter being determined either theoretically, from simula- 
tion experiments, or operator experience. For purposes 
of visualization, assume that the effects of two input 
variables upon a particular output variable are being 
investigated. For each pair of values of the input vari- 
ables there will be a corresponding value of the output 
variable. In general there will be a set of values of the 
input variables which produce the same value for the 
output and define what may be referred to as process 
output contour lines. Examination of these contour 
lines in a particular area will indicate in what direction 
within the plane to proceed in order to find better val- 
ues for the output variable. Once the direction has been 
determined, the computer investigates the new area in 
like manner. When the new area to be investigated be- 
comes relatively small, i.e., convergence to a solution is 
approached, the size of the programmed increments 
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may be diminished. Once the optimum setting for a pair 
of variables has been determined (for fixed values of 
other process variables) , it will be found generally that 
the effect of a third variable is to cause both a shift and 
change in size of the optimum output variable contour 
line. Fig. 1 illustrates the variation in the contour lines 
of the Vi, V 2 plane as a function of a third variable, V 3 . 
The values associated with each contour line are given 
by ki. 

The same type of procedure may be used not only in 
an experimental effort to gain information about a par- 
ticular process but also in using a computer to control 
the process. In this case, small adjustments are made in 
the controlled variables until a set of values is obtained 
that produces an optimum output. Whenever a devia- 
tion from the optimum occurs, the computer initiates a 
search for a new set of values of the controlled variables 
that will produce an optimum output. Thus, by experi- 
ment and successive approximations, an optimum solu- 
tion can be produced even in the absence of complete 
quantitative knowledge of the process dynamics. Useful 
data are obtained not only on the relationship between 
specified input and output variables but also on the ac- 
curacy of control that would be required for an allowable 
change in a given output variable. 

Computers for Process Control Applications 

For optimal control of an industrial process, it is us- 
ually necessary to maintain close control over a large 
number of process variables in a way which takes into 
consideration not only the effects of the individual in- 
puts on certain specified outputs but also the relative 
effects of these input quantities. The control system 
should have the capacity (in the event that a particular 
input cannot be made optimum) to generate a com- 
pensating change in one or more other variables of the 
system. Also, it is desirable that the computer be capa- 
ble of optimizing different process outputs in accord- 
ance with the current economics influencing the relative 
desirability of producing different output products, i.e ., 
the fluctuations of supply and demand. 

Once the computer has, by the processes indicated or 
similar ones, produced data on how a given process may 
best be controlled, it may subsequently be used to con- 
trol that process. As a process controller, it is desirable 
that it have the capability to 

1) monitor, store, and log process data, 

2) determine the values of the controlled variables 
that will optimize the output, 

3) actuate controllers, and 

4) check the system and itself to detect malfunctions 
in either. 

Before proceeding to the description of a digital com- 
puter useful for analysis and control of a typical in- 
dustrial control process like fractionation or distillation, 
it is desirable to review the general characteristics and 
capabilities of analog computers and the two major 
types of digital computers — namely, the arithmetic or 



Fig. 1 — Planar maps showing the variation of an output variable as 
a function of three input variables. 

integral transfer type of machine and the incremental 
transfer machine. 

The relative merits of analog and digital computers 
for process control applications have already been con- 
sidered in the literature. The conclusions reached from 
these comparisons are that while an analog computer 
may be adequate in certain cases, it does not in general 
have adequate capabilities to suit it for more sophisti- 
cated control systems. It is limited in its ability to per- 
form operations like the multiplication or division of 
variables, the generation of functions of several varia- 
bles, data correlation, extrapolation, etc. It does not 
have the capacity for logical operations, nor does it pro- 
vide adequate data storage facilities. It is not well suited 
for complicated correlation or data processing. Often, 
it may not be adequate even for relatively simple com- 
putations if there are a large number of them, or if non- 
linear functions are involved. In addition, a digital ma- 
chine offers greater flexibility in the sense of relative 
ease of modification of control functions and also in that 
it provides a number of facilities in addition to the 
computations required for control, such as data storage 
(including the storage of calibration data), logging op- 
erations, alarm generation, etc. Finally, the analog com- 
puter is more prone to faulty operation from marginally 
operating components and does not offer the self-check- 
ing feature of the digital machine. 

The relative merits of integral transfer and incre- 
mental transfer machines may be summarized as fol- 
lows. 

The integral transfer machine has excellent data stor- 
age and processing facilities with a large measure of op- 
erational flexibility. The cost of this is the price of a 
large main store and a large number of arithmetic and 
control circuits. The upkeep is also high because the 
complexity of programming and preventive mainte- 
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Fig. 2 — Functional diagram of industrial process analysis and control computer. 


nance procedures demands skilled programming and op- 
erating personnel. However, beyond a certain size, addi- 
tional capabilities and flexibility can be added at little 
cost. Maintenance of this type of machine is facilitated 
by its capacity to provide elaborate checks on itself and 
the control system in which it is incorporated. The 
integral transfer machine is superior to the incremental 
machine (even one with variable selectable slewing 
rates) in slewing time, and therefore better able to im- 
plement decisions. However, it is relatively inefficient in 
computations on continuous variables and capable only 
of a moderate computation frequency. 

Because of its efficiency in computing with continu- 
ous variables, an incremental machine with relatively 
few elements can provide good capacity and a high 
computation frequency. Its relatively small size gives it 
considerably better reliability with respect to failures, 
though its reliability with respect to malfunctions is 
comparable to that of the integral transfer machine. 
Also, its maintenance is often complicated because of 
some of the logical devices used in its design. In its basic 
form it is relatively poor in respect to slewing time. It is 
not well suited for problems of logical analysis and is 
lacking in certain data processing capabilities. 

TiiE Industrial Process Analysis and 
Control Computer 

It is apparent that both integral and incremental 
computation techniques offer advantages. A basic prem- 
ise of the machine described herein is that it is good 
economics to minimize the number of active storage and 
switching elements by the use of incremental computa- 


tion and stored logic wherever possible. This results in a 
speed of operation that is relatively slow but adequate 
for process control applications. A functional diagram 
of a machine of this type now being built is shown in 
Fig. 2. As indicated, all the functions of the computer 
are achieved through extensive use of a magnetic disk 
store in conjunction with a small magnetic core- 
transistor sequential network. The functions for which 
the machine was specifically designed include 

1) storage of data from process instruments, instru- 
ment calibration data, safe limits of variables, 
computation constants, etc., 

2) data processing and computational capabilities 
like integration, function generation, data cor- 
relation, smoothing and prediction, solving of dif- 
ferential equations, etc., and 

3) generation of signals for adjustment of controllers, 
generation of alarms, etc. 

Briefly, these functions are accomplished as follows. 

Storage of the values of all variables for a specified 
period, say an hour, at a sampling rate of one or two 
points per minute per variable is accomplished by use 
of a delay line of several thousand bits capacity. Thus, 
whenever a variable exceeds prescribed limits, the re- 
cent history of the process is available to aid in deter- 
mining the cause. These data can also aid in operator 
supervision of the process, since system trends can be 
checked by read-out of the same variables at different 
times as a plot. 

Analytical or empirical calibration data for selected 
variables are stored on a group of channels referred to as 
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linearizing channels. To alleviate the problem of instru- 
ment drift, measurement by each instrument of a 
known quantity can be transmitted to the computer at 
specified intervals. From these data, new calibration 
constants may be generated and stored as required. 

A relatively simple alarm control facility is obtainable 
by the allocation of certain channels for the storage of 
upper and lower safe operating limits of certain varia- 
bles. These data are continuously compared with the 
data in the long delay line. Exceeding a limit causes an- 
nunciators and other alarm indicators to be actuated. 
Off-limit data can also be logged or plotted whenever 
alarm conditions are indicated. Computational and logi- 
cal facilities can be provided to produce anticipatory 
indications of trouble, e.g., searches can be made for 
dangerous trends and the simultaneous occurrence of 
events that imply trouble. 

The analyzer is used principally to investigate over- 
all system behavior. It can serve as a simulator, com- 
puting the solution to equations describing the behavior 
of the process and its controllers. It can be used to study 
the dynamic effects of variations in the choice of con- 
trollers, and their set points, utilizing sampled data 
from the system itself. 

The function of the correlator is to aid in the deter- 
mination of the effects of different controllable parame- 
ters on the behavior of the system in order to ascertain 
what variables to control and to compute optimum set 
points for their controllers. 

Set point control is accomplished with the aid of 
channels in which the set points and control parameter 
settings believed desirable are stored. These data are 
continuously compared with sampled data from the 
process to determine current settings for the controllers. 
Also, the settings of the controllers are adjusted to 
maintain output optimization in accordance with data 
from the correlator. Any combination of proportional, 
rate, and integral control can be provided. 

A most important requirement of an on-line process 
controller is a long mean time between failures — of the 
order of six months. Satisfaction of this requirement too 
is facilitated by a design that minimizes the number of 
active storage and switching components. Various de- 
vices employed to reduce the component count while at 
the same time maintaining specified capabilities are 

1) simulation of active storage and switching ele- 
ments by passive storage elements, use of the disk 


store not only for the function of data storage, 
but also for arithmetic and logical transformations 
and control functions; 

2) multiplexing of stored information and time shar- 
ing of active components, 

3) the utilization of both incremental and integral 
transfer techniques, 

4) an organizational structure of the stored data 
which minimizes the amount of control circuitry 
required for manipulation of the data, and 

5) specific logical configurations that capitalize on 
the particular requirements of the control appli- 
cations in question. 

Active elements are used principally to direct data 
from the disk to the controllers and to output equip- 
ment, to modify data in any given channel, and to con- 
trol the flow of data between channels. 

A high degree of reliability is also promoted by using 
circuits operable over a wide range of parameters, by 
underrating circuit components, and by eliminating 
components considered inherently unreliable. Fail-safe 
operation is achieved by placing the computer in parallel 
with the process, i.e., so that it controls only the set 
points of the controllers. 

Over-all system performance depends not only on 
techniques for minimizing the probability of occurrence 
of a malfunction but also on rapid detection of a mal- 
function when it occurs. 

To serve this end, the entire disk system is periodi- 
cally and automatically given a test problem, and there 
is also provision for the insertion of special diagnostic 
programs by the system operator to allow any detected 
error to be traced to the process, including controllers, 
or to the computer. 

The type of computer system described can serve in 
a test and evaluation phase as a process analyzer to ' 
determine the feasibility of computer control and as a 
simulator to investigate the effects of different types of 
control schemes. It can also function as a fixed-program 
process-optimization control computer. Though a sin- 
gle machine can be provided with both capabilities, 
the economics of a particular situation may be such as 
to justify the use of separate computers, one a flexible 
analytic computer, and the other a relatively inflexible 
fixed program computer for control optimization. Each 
of these machines would, of course, be less expensive 
than the machine with both capabilities. 
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The Burroughs 220 High-Speed Printer System 

F. W. BAUERf and P. D. KINGf 


Introduction 

I N the past few years, computer manufacturers have 
been developing and delivering data processors with 
faster and faster processing speeds. Less attention, 
however, has been paid to getting printed information 
from the data processor. As a result, most data process- 
ing systems are input/output bound. 

The speed of the printing device has not been the only 
restriction ; expensive and time-consuming central proc- 
essor runs were necessary to perform the editing and 
formatting of the information for the printed page. 

In order to bring about a system balance and relieve 
the central processor of unnecessary data manipulation, 
a high-speed printing system with complete off-line 
editing was needed. 

To define the problem more closely, we studied such 
applications as: the printing of insurance premium 
notices and declaration forms; wholesale drug and 
grocery billing, oil company and public utility billing; 
and the preparation of bank statements, stock trans- 
fers, and dividend checks. To handle these applications, 
a printer system must not only be fast but also — con- 
sidering customer relations — capable of producing a 
neat legible document. 

Specifications 

The high-speed, large-volume, complex-editing ap- 
plication, then, was the general framework which de- 
fined the boundaries of our system planning. 

First of all, the new printer system must be a high- 
speed device, capable of producing the printing require- 
ments of a majority of applications. The necessity for 
system speed established the need for independent 
printer control of format and editing, since high-speed 
output which depended on extensive editing by the data 
processor could not be considered true speed at all. 

Independent operation and speed requirements indi- 
cated that magnetic tape was the logical choice for com- 
munication between the central processor and printer 
system. By using magnetic tape as a buffer, printing as 
far as the central processor is concerned, is at magnetic 
tape speeds; the central processor need never wait for 
the printing device to accomplish its task. 

To provide further versatility it was deemed advis- 
able to allow the printer system to work directly with 
the central processor for those applications requiring on- 
line operation. 

As a final requirement we decided we would not want 
to prepare special print tapes for off-line operation or 
require central processor time for editing when on-line. 

f ElectroData Div., Burroughs Corp., Pasadena, Calif. 


In either case our obvious aim was to reduce the amount 
of data manipulated within the central processor. Print - 
ing from master tapes or records was a must. 

This, in brief, is the application framework within 
which we established the performance specifications of 
our new printer system. The term “printer system ” is 
significant. In this case, terminology was dictated by a 
desire to describe accurately the operation of the units, 
independent of direct processor control. 

General Description 

The Burroughs 220 High-Speed Printer System (Fig. 
1), is a transistorized, buffered, on-line/off-line subsys- 
tem with versatile editing capabilities controlled by a 
plugboard. The system, which includes a Printer Con- 
trol Unit and a Printing Unit, is designed to operate on- 
line with the Burroughs 220 Data Processor, or off-line 
with one or two standard Burroughs 220 Magnetic Tape 
Storage Units. 

Control Unit 

The Printer Control Unit (Fig. 2), houses an 1100- 
digit, random-access core storage used as a buffer, 
which accommodates up to 100 computer words (10 
digits plus sign). The Control Unit also contains the 
system’s control circuitry, a 120-character print register, 
a 120-position bit register, the transistor power supply, 
and the plugboard. 

Printing Unit 

The Printing Unit contains a drum-type, high-speed 
printer (Fig. 3), having 120 print positions, with 10 
characters to the inch, and a total of 51 characters per 
print position. Of these, 15 are special characters, in- 
cluding CR, OD, DR, + and — symbols. 

The Printing Unit (Fig. 4), also contains paper mo- 
tion controls and the power required to drive the print- 
ing mechanism. 

Vertical line spacing is fixed at six lines to the inch. 
Printing can be positioned any place on a 16-inch form. 
The printer can accommodate a maximum form width 
of 20 inches for centered printing. The maximum form 
length is 22 inches. The printer can print an original and 
five carbon copies. 

Complete control of paper skipping (Fig. 5) is ac- 
complished by means of a seven-channel punched paper 
tape loop, | inch wide and photoelectrically sensed. 
The paper tape loop provides six predetermined paper 
skip positions and a carriage exit position, which is used 
primarily for page overflow or for logical decisions. 
Thus, jumping to and printing header information on 
the next form is easily programmed. 
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Fig. 5 — Printing unit, side view. 


Fig. 2 — Control unit. 

51 CHARACTERS 


120 POSITIONS 

Fig. 3 — Print drum. 

Under plugboard control, page-skipping and single- or 
double-line spacing before and after print allows easy 
accommodation of preprinted forms. Paper moves at 
the rate of 25 inches per second, with 9 msec start-stop 
time for a single-line space time of 16 msec. 

The printer may be operated at print drum speeds of 
750, 900, 1500, and 1800 rpm. The effective printing 
rate is dependent upon the information to be printed. 
For alphanumeric information, the effective printing 


rates are 624, 720, 1068, and 1225 lines per minute. If 
only numeric information is to be printed the effective 
printing rates automatically become 750, 900, 1500, and 
1225 lines per minute. 

These higher numeric print speeds are accomplished 
by detecting in the control unit during the loading of 
the print register that only numeric information is to 
be printed. (See Fig. 6.) When this condition exists the 
control unit terminates the print cycle at the end of the 
numeric section of the print drum rather than requiring 
a full drum revolution. The time required to traverse 
the remainder of the print drum, the alphabetic sec- 
tion, is used by the printer system in spacing paper and 
reloading the print register. 

It can be seen that at the top drum speed of 1800 rpm 
the printing rate is only 1225 lines per minute for nu- 
meric information, the same as for alphanumeric infor- 
mation. This is because the time required to load the 
print register or space the paper is greater than the time 
required to traverse the alpha section when the print 
drum is rotating at 1800 rpm. 

Fig. 7 is a front view of the print unit showing hori- 
zontal paper positioning controls, vertical paper posi- 
tioning controls, paper tension controls, the ribbon 
mechanism, and the upper and lower set of form trac- 
tors. Lateral positioning of printing and form size ad- 
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Fig. 6 — Character arrangement on print drum. 
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Fig. 7 — Printing unit, front view. 

justment are accomplished with the control in the up- 
per right of the picture. 

Controls are available for moving all tractors to the 
right or left in synchronism for positioning of printing 
on the page. In addition, controls are available for mov- 
ing either the right or the left set of tractors individually 
for accommodating form width. Paper tension between 
the upper and lower sets of tractors is controlled by the 
upper left control knob. By using two sets of tractors, 
the upper and the lower, and positive mechanical de- 
tenting, paper creep is eliminated. Fine vertical adjust- 
ment within a line is accomplished with the lower left 
control knob. 

Operation 

With some of the details out of the way, we can now 
push a few buttons and see how the Burroughs 220 
High-Speed Printer System works. The internal opera- 
tion of the system is divided into three basic cycles 
(Fig. 8): 

1) Load cycle — During the load cycle, the core buf- 
fer is loaded from magnetic tape or from the Data Proc- 
essor. 



Fig. 8 — System flow diagram. 


2) Scan cycle — The scan cycle is basically the trans- 
ferring of selected information from the buffer to the 
print register. During the scan cycle, all editing, for- 
matting and selection is accomplished. The scan cycle 
is normally completed during paper spacing time. 

3) Print cycle — During the print cycle the line of in- 
formation is transferred from the print register to the 
paper. Since the core buffer is not used during the print 
cycle, the load and print cycles can occur simulta- 
neously. 

Load 

When used off-line, the load cycle is initiated from 
the plugboard. Information is read from one of two 
magnetic tape units. The block length on tape is vari- 
able from 10 to 100 words and can be variable within a 
run. The number of blocks read per load cycle is selected 
from the plugboard, the only restriction being that no 
more than 100 words can be loaded into the buffer at 
any one time. If the capacity of the buffer is exceeded, a 
buffer overflow alarm is available on the plugboard for 
automatic corrective action if needed. Selection of the 
storage area to which the information will be sent is 
plugboard controlled. As a result, it is possible to have 
in the buffer at one time any combination of information 
from 2 tape units and the data processor. 

When used on-line, loading of the buffer is controlled 
by two commands from the data processor. The first 
command is used to determine if the printer system is 
ready to accept information. The second command loads 
the buffer with a record from 1 to 100 words in length. 
Once loaded, the data processor is freed. 

Scan 

The scan cycle is so named because information to be 
printed is not transmitted by plugboard wires but 
rather by internal channels. The scan cycle includes the 
transfer of selected buffer information through a trans- 
lator to the print register for subsequent printing. All 
editing of information is accomplished during this phase 
of the printer operation. The plugboard wiring is used 
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for decision making, selection of starting positions of 
fields, special character insertion, editing, and format- 
ting functions. 

During scan, the starting position of a field is selected ; 
the field then reads out sequentially to the print register, 
with no restriction on field length, and continues until 
ordered to a new starting location. After a change of ad- 
dress has been ordered the scan continues sequentially 
from the new address. The addressing of the buffer is 
controlled by a counter called the character address 
counter, which changes the location when set to a new 
value. Fig. 9 illustrates the operation during the trans- 
fer of one digit from the buffer to the print register. The 
character address counter selects the digit to be trans- 
ferred, out of 1100 possible settings. For each one of 
these settings there is an exit hub on the board. These 
exit hubs are the source of control pulses, to cause ad- 
dress selection, formatting and control. Other functions 
which can be initiated by these pulses include zero sup- 
pression and the insertion of blanks, commas, decimal 
points or dollar signs. A feature which helps reduce the 
amount of information to be manipulated is the char- 
acter emission of all 51 characters for printing of fixed 
information — the date, for example. 

When information is read out of the buffer, the digit 
value is available on the plugboard and can be used for 
controlling and testing purposes. Digits so used need 
not be printed; for instance, decisions whether to print 
a record or not can be based on the digital value of a key. 

The scan cycle is automatically terminated when the 
print register has been filled with 120 characters, and a 
print cycle is automatically started. At this time an end 
scan pulse is available for initiating a buffer load if de- 
sired. 

Print 

At the time the print cycle starts, the print register is 
filled with a 120-character line. There is a counter syn- 
chronized with the rotation of the print drum which 
tells at any instant the next character on the drum in 
position to print. By comparing each position of the 
print register with this counter the positions to be 
printed are determined. At the start of the print cycle 
this comparison process begins immediately (Fig. 10). 

To illustrate, if the counter were at a value corre- 
sponding to R all R's in the print register would be 
printed first. When the print drum has rotated through 
one position and the counter advanced to S, all the 5’s 
will be printed and so on. The print cycle will be com- 
pleted in this case when the print drum has rotated back 
to R. The printing actually occurs by timed firing of 
print hammers. The 120-position bit register contains 
the “yes” or “no” of whether the print hammer associ- 
ated with a particular print position will fire at a speci- 
fied character time. Loading the bit register is accom- 
plished by means of the print register comparisons just 
mentioned. 



Fig. 10 — Print cycle. 


If spacing is wired on the plugboard to occur after 
printing, paper spacing will start immediately after the 
print cycle. A start scan pulse is available on the board 
to start a new scan cycle, assuming no load cycle is tak- 
ing place. If a load cycle is not yet completed, the start 
scan impulse will be held up until it is. However, a load 
cycle will normally be completed during a print cycle, 
except at the highest drum speeds. 

Special Features 

This, in essence, is the Burroughs 220 High-Speed 
Printer System. We will look next at some of the special 
features of this system. Fig. 11 lists the functions avail- 
able to complete the editing ability and enable printing 
from master tapes; these are initiated by character ad- 
dress pulses. 

The special features of the Burroughs 220 High-Speed 
Printer System are described in the following para- 
graphs (Fig. 12). 

Wiring by Exception 

The control panel need only be wired when informa- 
tion is to be printed out in a different sequence than that 
contained in the buffer. In this case, the character ad- 
dress pulse of the last digit of a field to be transferred is 
wired to address the starting position of the next field. 
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1. Zero suppress 

2. Check protecting asterisk insertion 

3. Comma and decimal point insertion 

4. Insertion of blanks 

5. Delete 

6. Special sign translation to +, — , CR, OD, DR 


Fig. 11 — Special functions. 


1 . Wiring by exception 

2. Field selection 

3. Exception or selective printing 

4. Field interrogation 

5. Multiple line printing 

6. Retention of fixed information 

7. Relative addressing 


Fig. 12 — Special features. 

Thus, field selection is accomplished. Since it is not 
necessary to provide plugboard wires for the transmis- - 
sion of all positions of a field the actual number of plug- 
board wires is reduced to a minimum. 

Exception or Selective Printing 

Transferring the digit value of the contents of a buf- 
fer location to the plugboard during transfer of informa- 
tion to the print register allows recognition of keys with- 
in the information. Thus, logical electronic elements 
cause or prevent printing. For example, when printing 
paychecks, the plugboard can be wired to print checks 
for only those employees with earnings, ignoring all 
employees with no earnings. 

Field Interrogation 

Logical decisions can also be made by comparing 
buffer information to preset information in 10 rotary 
switches. For example, in a statement preparation ap- 
plication, the date would be set in the switches. Only 
those accounts scheduled for that date would have a 
statement printed. All other accounts would be skipped. 

Multiple Line Printing per Buffer Load 

Because the buffer is actually a storage device, any 
number of lines can be printed from one buffer load. 
Repeat printing of any information is possible. Com- 
plete documents can be prepared with one reading of 
the tape record. 

Retention of Fixed Information 

The ability to select the starting address during load 
enables the retention of information in the buffer, for 
example, page heading information. This feature is par- 
ticularly useful when multipage documents are to be 
printed. 

Relative Addressing 

(See Fig. 13.) This is a form of indexing register which 
enables grouped records to be printed in the same for- 
mat with wiring for only the first of the records. The 



character address pulses are always governed by the 
character address counter. However, the information is 
available from the buffer position determined by the 
sum of the setting of the character address counter and 
the relative address register. Thus, by changing the 
value of the relative address register, different informa- 
tion can be read out of the buffer with the same value 
of the character address counter. This feature allows 
side-by-side printing without duplication of wiring. 

It is the relative address register which determines 
the starting address during a load cycle. The character 
address counter is always set to zero at the start of a 
load cycle. As during the scan cycle, the sum of the char- 
acter address counter and relative address register ac- 
cesses the buffer. 

Checking 

Complete checking of all information transfer and 
programmer control of error conditions are provided. 
Fig. 14 shows the checking points and methods within 
the system. 

During load, each digit is checked for parity and in- 
valid combinations. In addition, a check is made on the 
number of digits in the tape record. If any error is de- 
tected, the tape is automatically reread and an error sig- 
nal emitted from the board. If an error persists after two 
retries, the system automatically stops, unless pro- 
grammed to ignore this stop or to take other remedial 
action. 

Parity is checked during the transfer from the buffer 
to the print register through the translator. Parity and 
invalid characters are checked again in the translator, 
and errors are indicated on the plugboard. 

During printing, another parity check is performed 
with errors again indicated on the plugboard. In addi- 
tion to parity checking during printing, a synchroniza- 
tion check is made which insures that the print position 
was fired at the correct time for a given character. This 
print check feature (Fig. 15) works as follows. 

A reluctance emitter in the printer counts a row count- 
er. This counter determines the character to be read 
out. A home pulse from a second reluctance emitter is 
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Fig. 14 — System checking. 


compared with the row counter when the latter’s value 
is at zero. If the two are out of synchronization, the 
system automatically halts. 

This is the only checking feature which cannot be ig- 
nored ; the print check alarm is not available on the plug- 
board. 

All error checking alarms that are available on the 
plugboard can be used to cause retries or brute-force 
operation which can be flagged on the printed page. 
Thus, in every case but one, the programmer, not the 
machine, decides whether an operation is to be halted or 
not. Oftentimes, the programmer will want to wire auto- 
matic restart procedures on the plugboard. 



Fig. 15 — Synchronization check. 


Because of the versatility of the Burroughs 220 High- 
Speed Printer System, all operations must be pro- 
grammed or wired. The operator effectively provides 
his own logical operations by wiring. 

Conclusion 

To summarize, the Burroughs 220 High-Speed 
Printer System offers a maximum of editing versatility 
with minimum plugboard wiring. More important, it 
allows swift, simple but complete rearrangement of buf- 
fer information — and eliminates the necessity for com- 
plex and time-consuming data shifting within the com- 
puter or the preparation of special print tapes. 

Because of this flexibility and power, the printing 
problems of a wide range of applications can be solved 
with ease. 


The ACRE Computer — A Digital Computer 
for a Missile Checkout System 

RICHARD I. TANAKAf 


Introduction 

T HE effectiveness of a missile system is directly de- 
pendent upon the proper assembly and subse- 
quent reliability of its various subsystems. A sup- 
porting checkout system which enables rapid, consistent, 
and thorough testing of subsystems is an essential item 
in insuring the over-all operational success of a complex 
missile. 

This paper describes a digital computer which is used 
as the central controller in an automatic checkout sys- 
tem. The system itself is called ACRE, for Automatic 
Checkout and Readiness Equipment; the computer is 

t Lockheed Missiles and Space Div., Palo Alto, Calif. 


referred to as the ACRE computer. The ACRE com- 
puter is, essentially, a general-purpose, stored -program 
digital computer; particular capabilities, however, have 
been emphasized to enable efficient operation of the 
checkout processes. 

The computer and associated system are required to 
perform functions which can conveniently be grouped 
as follows: 

1) Monitor key quantities which indicate the exist- 
ence of conditions hazardous to the missile or to 
associated personnel. 

2) Perform detailed checkout on a newly manufac- 
tured missile system to inspect for proper opera- 
tion or to diagnose possible causes of malfunction. 
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3) Perform tests on a standby missile system at pe- 
riodic intervals, to verify tactical readiness. Again, 
if a malfunction is detected, a diagnosis is required. 

4) Execute, rapidly, the test sequence required prior 
to firing. For possibly marginal systems, a quanti- 
tative measure of operational success probability 
is desirable. 

It is possible, of course, to meet the above require- 
ments by utilizing manual methods or special-purpose 
test devices. There are obvious disadvantages, however, 
to both of these procedures. The described digital com- 
puter enables efficient operation in all of the modes de- 
scribed above and affords the following salient advan- 
tages. 

Adaptability 

The system can easily adapt to changes in the mis- 
sile, in the test procedures, or in the criteria used for 
evaluating results. If required, the system can be used 
to test differing kinds of missiles. 

Reproducibility of Test 

Once a test sequence has been established, there are 
no unwanted variations in the test procedure thereafter. 
The operational advantages of a special-purpose check- 
out device are obtained. 

Versatility 

In the event that particular test results indicate a 
system malfunction, the computer program enables a 
wide variety of alternate procedures to be followed. 
Some factors for determining alternatives are: 

1) the importance of the malfunctioning subsystem; 

2) the extent of the malfunction; 

3) the particular test mode in progress ( i.e ., a system 
in the process of standby check offers alternatives 
not available for a system which is being checked 
in preparation for firing) . 

Possible reactions to an indicated system malfunction 
might include a retest against less stringent test limits, 
halt of the test sequence, or automatic switching to 
standby system components. 

Automatic Sequencing 

By using a stored program which includes the pro- 
cedures to be followed if a malfunction is detected, the 
test sequences become completely automatic in their 
execution. This feature is particularly valuable during 
countdown; the test times are accurately known, and 
human operator errors (which could easily occur during 
the stress of tactical countdown) are eliminated. 

Self-Diagnosis Possibilities 

As a corollary to the above, the computer also can be 
programmed to execute routines intended to monitor 
the operation of the checkout system itself. The rou- 


tines can be commanded to occur at periodic intervals, 
or may be the first step following an indicated system 
malfunction (to verify that the missile system, not the 
checkout system, is at fault). 

General System Operation 

Fig. 1 illustrates the functional requirements for a 
general checkout system. The following operations are 
required. 

Select and Control Test Stimulus 

If a signal source is required as input to a subsystem 
or system, the appropriate signal generators are se- 
lected and adjusted to provide the proper levels. 

Select Input to Missile 

The test stimulus is then channeled to . the proper 
input terminal of the system under test. 

Obtain and Convert Results 

The resulting output from the system (the test value) 
is converted, if necessary, from analog to digital form. 

Perform Comparison 

The test value is compared against previously estab- 
lished test limits. 

Evaluation of Test 

If the test value falls within the test limits, a “Go” re- 
sult is obtained ; if the test value falls outside the allow- 
able limits, a “No-Go” result is obtained. These two 
possibilities determine the choice between two paths: a 
successful test or “Go” result causes the normal test se- 
quence to continue; a “No-Go” result causes execution 
of evaluation modes as previously described. 

Documentation 

For later diagnostic purposes, it is extremely im- 
portant that a detailed and accurate record be kept. 
Furthermore, the documentation process should be as 
completely automatic as possible. 

Display 

For supervisory purposes, a display panel of suitable 
information (test number, results, etc.) is required. 

General Characteristics of the 
ACRE Computer 

The ACRE computer is designed to meet the func- 
tional requirements outlined above; further, since the 
checkout system must operate under tactical as well as 
laboratory conditions, requirements of reliability, main- 
tainability and environmental suitability are also in- 
volved. In a very general sense, the machine satisfies the 
latter requirements with conceptually straightforward 
logical organization and carefully designed, conserva- 
tively operated circuits. 
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and overrides 

Fig. 1 — Generalized checkout system. 

Physical Characteristics 

The specifications on physical volume enable an over- 
all assembly oriented toward accessibility and ease of 
maintenance. The computer circuits, on plug-in etched 
card assemblies 6X6 inches in size, are arranged in hori- 
zontal chassis in groups of 25. The chassis are wired into 
assemblies termed “pages”; two pages, forming a 
“book,” comprise the computer. 

The front of a page is shown in Fig. 2 ; the intercon- 
nection wiring is on the back of each page. 

Fig. 3 illustrates the flip-flop card, which contains 
three transistorized flip-flop circuits. Diode logic gates 
are fabricated on similar cards. 

The computer requires approximately 40 flip-flops, 
and operates at 100-kc clock rate. Physical dimensions 
are 20X20X56 inches high. 

To aid in the maintenance of the computer, a special 
maintenance panel is attached, which includes: 1) a 
tester for the various circuit cards; 2) a memory simula- 
tor which allows static test of the computer with the 
magnetic memory disconnected; 3) controls which al- 
low single clock operation and manual setting of flip- 
flops; 4) indicators attached to the individual flip-flops; 
and 5) voltage level switches for marginal checking. 

Depending upon external system requirements ( i.e ., 
tie-in with missile subsystems, various converters, etc.), 
as many as 30 additional flip-flops may be required. 
This quantity varies directly with the desired system 
performance requirements. 

Memory 

The ACRE computer utilizes a rotating magnetic 
memory with an addressable capacity of 3904 words, 
24 bits in length. These are organized into 60 channels, 
each with 64 words, and four faster access channels of 
16 words each. 

The memory also contains a nonaddressable 64-word 
channel which is used for automatic storage and output 
tape buffering for the later described documentation 
capability. Further, the clock pulses, sector reference 
information, and the various one-word registers are 
supplied by the magnetic memory. 



Fig. 2— ACRE book: front view. 



Fig. 3 — ACRE flip-flop board. 


The six 24-bit registers on the memory are: 1) the A 
register, the main arithmetic register in the machine; 
2) the D register, an auxiliary arithmetic register; 3) the 
B register (Index register), used both for address modifi- 
cation and for tallying purposes; 4) the Order Counter, 
which specifies the address location of the next order to 
be executed; 5) the Address register, which stores the ad- 
dress portion of an order obtained from memory; and 6) 
the Documentation or X register, which receives in- 
formation either from selected arithmetic registers or 
from an external keyboard, for eventual storage on 
magnetic tape. 
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Logical Organization 

An Order Counter is used to specify the normal 
program sequence; the order structure is single address. 
An order word contains a 5-bit command code, a 12-bit 
address (designated as m), and single bits each for 
parity, ^-reference, documentation, and spacer. 

Numerical information is represented as sign and 
magnitude (fractional binary), with single bits for parity 
and spacer. 

The machine has a total of 25 commands; these can 
be categorized loosely as 5 arithmetic commands, 9 
internal transfer commands, 6 control commands (jump 
to location m for various conditions), and 5 special 
commands which relate directly to the checkout re- 
quirements. (The latter 5 are included in the later de- 
scription of special commands.) 

Special Computer Features 

Miscellaneous features, which define some of the 
capabilities of the ACRE computer, are described below. 

Parity 

To detect malfunctions in the transfer and retention 
of information, word contents undergo automatic 
parity check during transfer. If a word is modified by 
arithmetic processes, a new parity bit is generated and 
inserted in the word. Parity error causes the computer 
to halt; the operator is notified by a suitable alarm 
indicator. 

Switching Matrix 

The ACRE system incorporates a switching matrix 
which is directly controllable by the computer. At 
present, a relay switching matrix is used, since the speed 
of the relay network is sufficient to meet existing sys- 
tem requirements. (Also, relays enable convenient 
handling of a wide range and class of test variables with- 
out elaborate preprocessing.) If desired, a solid-state 
switching network can easily be substituted. 

B Register 

As previously mentioned, an Index register is pro- 
vided for automatic address modification and for tally 
uses. A one-bit in the ^-reference position of an order 
word causes the contents of B to be added to the address 
portion of the order prior to execution. 

For tally purposes, commands which increase or de- 
crease the contents of B are provided. The decrease B 
command allows branching on the sign of B. The logic 
allows this command to be used either for branching 
each time until the final traversal of a loop, or for the 
opposite case of not branching until the final traversal. 

Because of the similarity of many of the test se- 
quences, the B register enables a significant saving in 
storage required. Since, for tactical operation, the pro- 
gram and all associated parameters are stored perma- 
nently on the memory, the inclusion of a B register (at a 


cost of two flip-flops, associated logic, one read and one 
write amplifier) contributes much more than program- 
ming convenience alone. 

Documentation 

The documentation feature enables automatic re- 
cording of all information pertinent to the checkout 
processes. Normally, the information to be stored on 
the output tape is specified by the program and hence 
is documented automatically; however, to enable the 
operator to insert additional information, input from a 
keyboard can be documented during intervals when the 
computer is idling. 

During computation, a one-bit in the documentation 
code position of an order word causes information ap- 
propriate to the command to be documented (two 
examples: the sum for an addition command; the word 
transferred, for any of the various transfer commands). 

This information progresses from the one-word X 
register into the 64-word special storage channel. When 
the channel is filled, the entire channel contents are 
transferred automatically to an output magnetic tape. 

The output tape recorder uses magnetic tape 1 inch 
wide, on 8-inch reels, each with a total capacity of 5 
million words of storage (each word 24 bits long) . Cross- 
channel and longitudinal parity are automatically 
generated and recorded. 

Display 

The progress of each test sequence is indicated by a 
panel of display lights. The display lights are addressa- 
ble through the switching matrix; the operator receives 
a direct indication of the sequence in progress, and of 
test results obtained. 

Input-Output 
Program Test Loader 

The magnetic memory is filled by the Program Test 
Loader, ademountable input device using seven-channel, 
punched paper tape. The Loader uses channel parity 
and cross-channel parity to check information pickup; 
as a further safeguard, the memory contents are auto- 
matically read back and verified. 

For laboratory use, the Loader can be used as a 
normal input reader. 

For tactical operation (where the test sequences pre- 
sumably have been generated, tested, and then stored 
in their entirety in the memory), the Loader is discon- 
nected. Disconnecting the Loader insures that no in- 
advertent modification of memory contents can occur. 
The stored routines are so designed that the operator 
is required only to monitor the results, or, in a few 
instances, to initiate sequences by push button control. 

External Controls 

The operator is provided with a minimal selection of 
controls. These include Start, Stop, the documentation 



Tanaka: The ACRE Computer — A Digital Computer for a Missile Checkout System 


221 


keyboard, and Test Selectors. The latter are individual 
buttons which set the Order Counter to preselected con- 
figurations, enabling convenient manual selection of 
particular tests. 

For maintenance purposes, controls affecting physical 
conditions, e.g., single clock, marginal checking voltages, 
drum simulator levels, flip-flop set and reset switches, 
are provided. Further, a Function Switch enables selec- 
tion among three modes of operation: Normal Opera- 
tion, Single-Order Execute, and Breakpoint Operation 
(the last-named causes halt after execution of each 
command accompanied by a breakpoint code-bit). 

Output Display 

A numerical display device can selectively display 
the contents of the Order Counter, the A register, or 
the Documentation Register. The Order Counter dis- 
play consists of four octal digits; the other two registers 
appear as sign and four decimal digits. 

As previously mentioned, test indicators, primarily 
controlled by the switch matrix, are provided. These 
indicate, for example, status of various missiles, test in 
progress, subsystem under test, test results, etc. 

Discussion of Commands 

The commands available in the ACRE computer have 
been mentioned in the description of logical organiza- 
tion. Seven of the 25 commands directly related to the 
checkout process are discussed in detail below. 

Compare ( cpr ) 

The contents of the A register are compared with the 
contents of D\ if A is greater than D, the computer will 
transfer control to the order found in memory position 
m. Both registers remain unchanged. The comparison 
process is basic to the operation of the checkout system, 
and is required frequently. Hence, although the com- 
parison, and transfer of control, can be performed by a 
subroutine, it is convenient to have the process avail- 
able as a command. 

Switch ( swc ) 

Twenty bits in position m are transferred into the 
Switching Address Register. At the end of the transfer 
process, a signal to initiate switching is generated. The 
switching matrix proceeds to establish the connections 
specified by the 20 bits; the computer is free to proceed 
with the program. An interlock is provided, so that if 
switching is still in progress when a new switch opera- 
tion is commanded, the computer will idle until the 
previous operation has been completed. 

Select (sit) 

The four least significant digits in the address code m 
are used to set the four flip-flops of the Select Register. 
The register contents then determine which item of 
external equipment is to be affected by subsequent 
commands. 


Conditional Adjust Test Equipment ( caj ) 

An item of test equipment, designated by the Select 
Register, receives the 12 bits of the address m. The bits 
are made available only after any previously com- 
manded switching operation has been completed. 

The selected equipment could be a signal generator 
whose output is adjusted by the 12 bits; it also could 
be an analog-digital converter, whose scale adjustments 
or whose turn-on signal is derived from the 12 bits. 
The switch interlock insures that switch connections 
pertinent to the operation of the selected equipment 
have been made before the equipment is activated. 

Unconditional Adjust Test Equipment ( uaj ) 

This command is similar to “caj” above, except that 
no interlocks with the switch operation are provided. 
This command is used when the desired connections 
are known to be made. The unconditional aspect insures 
that a simultaneous switching operation related to 
another test cannot inhibit an operation initiated by 
this command. 

Bring Test Equipment Output ( bte ) 

The output from an item of test equipment, desig- 
nated by the Select Register, is read into D and also 
into position m. This command is used, for example, to 
obtain the output of an analog-digital converter for 
subsequent comparison against programmed limits. 

Halt (hit) 

The Halt command causes the computer to idle. 
Simultaneously with the Halt, the Select Register re- 
ceives the four least digits of the address code m. The 
machine will start automatically when one of the 16 
signal sources, selected by the contents of the Select 
Register, turns on. This allows the checkout system to 
wait for the establishment of various conditions in the 
missile before proceeding with a test. 

Conclusions 

The ACRE system has demonstrated the feasibility 
of assigning to a digital device of the stored program 
class all of the central control requirements for a missile 
checkout system. The advantages inherent to a stored 
program have contributed significantly to the deriva- 
tion and application of test sequences at all levels of 
missile operation, from manufacturing to field readiness 
to prelaunch countdown. The tests, in turn, are of 
paramount importance in insuring the highest possible 
degree of successful missile operation. 
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IBM 7070 Data Processing System 

J. SVIGALSf 


I N 1953, International Business Machines Corpora- 
tion (IBM) introduced its first production model 
of a large-scale computer — the IBM 701 Data 
Processing Machine. Since that time, the company has 
engineered and delivered a number of major data proc- 
essing systems, each designed to meet certain business 
and scientific requirements. 

These systems have included the 305 Ramac, 650 
Basic, 650 Tape, 650 Ramac, 702, 704, 705, 705 III, and 
709. In addition to these computer systems, the com- 
pletely transistorized 608 Calculator has been in produc- 
tive operation since 1957. 

Each of these systems has extended and refined the 
growing body of practical knowledge of both the versa- 
tility and limitations of data processing equipment. The 
result has been an increase in the total productive time 
of the machine installations and more efficient use of 
these machine hours. 

More important, the application of these systems to 
actual working problems has produced thousands of 
specially trained personnel in the business world as well 
as within IBM itself. 

The new IBM 7070 Data Processing System is a 
product of the ideas and expressed needs of this experi- 
enced group. It is designed to serve as a partner of the 
man-machine-methods team of modern business. 

The purpose of this paper is to describe the general 
organization of the IBM 7070. This description will in- 
clude a discussion of the special features of the system 
and its physical and electrical characteristics. 

General Organization 

The IBM 7070 Data Processing System combines ad- 
vanced engineering design, based on high-speed, solid- 
state components, with an equally modern machine or- 
ganization. The system is capable of efficient solution of 
both commercial and scientific applications. The IBM 
7070 Data Processing System spans a wide range of 
capacities and features. These include punch card, mag- 
netic tape, and magnetic tape/Ramac configurations. 

The punch card IBM 7070 system consists of the 
central computer, one to three punch card readers, and 
one to three card punches or printers. The card readers 
operate at a speed of 500 cards a minute; the card 
punches operate at a speed of 250 cards a minute; and 
the on-line printers operate at a speed of 150 lines per 
minute. In the card system configuration, all card input 
and output devices can operate completely simultane- 
ously with each other and with internal computation. 

To achieve a tape system configuration of the IBM 
7070 Data Processing System, two tape channels are 

t Regional System Dept., IBM Corp., Los Angeles, Calif. 


added to the computer configuration. Each tape channel 
is capable of operation simultaneously with the second 
tape channel and internal computer operation. Each 
tape channel may have from one to six magnetic tape 
units connected. These may be any combination of regu- 
lar-speed (41,667 characters per second) or high-speed 
(62,500 characters per second) magnetic tape units. This 
allows for a total of 12 magnetic tape units, any two of 
which will operate simultaneously with computation. 

A tape Ramac IBM 7070 configuration is obtained by 
the addition of one to four Ramac file units to the sys- 
tem. The Ramac units are interconnected so that they 
may be operated through either tape channel. This al- 
lows completely simultaneous operation of two Ramac 
read and/or write operations in a manner equivalent to 
that of the magnetic tape units. Each Ramac file pro- 
vides three access arms. Experience with the IBM 650 
Ramac systems indicates that access to these files can be 
achieved effectively in zero time. This is accomplished 
by seeking ahead for the next record while a previous 
record is processed. 



Fig. 1 — The IBM 7070 Data Processing System. 

A typical IBM 7070 system is shown in Fig. 1. In- 
cluded are the following units: 

Console: This is a separate unit which includes the 
console typewriter and a small operator’s panel. The 
console unit is designed to simplify and expedite the 
operator’s task and to insure maximum productive ma- 
chine time. The typewriter is the principal operator’s 
tool. It replaces many of the indicator lights and control 
switches of previous data-processing machines. Operator 
error is minimized by the computer’s ability to audit 
operator commands through a stored program and by a 
printed record of all data entered and emitted through 
the console typewriter. 

Magnetic Tape Units: The magnetic tape units are 
seen to the right, rear of the picture. Two types of units 
are available. The 729 II reads or writes tapes at a rate 
of 41,667 characters per second. The Model IV reads or 
writes tapes at a rate of 62,500 characters per second. 

Card Reader: Immediately in front of and to the left 
of the magnetic tape units is a card reader. This unit 
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operates at a rate of 500 cards per minute with format 
control by means of a control panel mounted on the 
reader. Data from a full 80-column punched card may 
be transferred into the computer simultaneously with 
internal computer operations. The card reader is 
equipped with a front attended tray feeding hopper and 
stacker. As many as three card readers can be utilized 
for card input. Selected cards may be offset in the 
stacker. 

Card Punch: To the right of the card reader is the 
card punch. This unit operates at a punching speed of 
250 cards per minute with format control by control 
panel wiring. Front attended hopper and stacker are 
used. Selected cards may be offset in the stacker. As 
many as three card punches can be utilized for card out- 
put. 

Printer: The printer is located to the right front of 
the picture. This unit operates at a speed of 150 lines 
per minute with format control provided by the control 
panel. The printed line output consists of a span of 120 
characters spaced ten to the inch. 

Central Processing Unit: This unit is seen to the left 
rear. It contains most of the system electronics and con- 
sists of the following elements: 

1) Arithmetic registers and core memory, 

2) Indexing hardware, 

3) Space for optional floating decimal arithmetic 
(with automatic double precision operation), 

4) Magnetic core memory of five or ten thousand 
words, 

5) Two data channels, code translators, data registers 
and controls for magnetic tape and Ramac storage 
units, 

6) Buffers and controls for card input, card output, 
and the printers. 
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Fig. 2 — Two-out-of-five code. 



special typewriter equipped with a solenoid-driven key- 
board and transmitting controls. A 16-channel punched 
mylar tape provides format control. Up to 10 manual 
inquiry stations can be attached to the system, through 
two buffers. The stations can be connected to the system 
by cable up to 2500 feet from the central processing unit. 

Machine Characteristics 
Bit-Code Structure 


Disk Storage Units: To the left of the picture are the 
disk storage units. These units consist of a stack of large 
disks which magnetically hold up to 12,000,000 digits 
each. Information is read from and written onto these 
disks by an access mechanism containing a magnetic re- 
cording head. This mechanism moves rapidly to any 
disk in the file. Three access mechanisms for each stor- 
age unit are provided to minimize access time by over- 
lapping the "seek” operation. Up to four disk storage 
units can be utilized by the 7070 system, providing a 
total storage capacity of up to 48,000,000 digits of rapid, 
random access memory. These units are attached to the 
system by the same two channels that connect the mag- 
netic tape units to the system. Each of these channels is 
linked to a disk storage unit by program control, and 
allows any combination of simultaneous read/write/ 
compute. 

Manual Inquiry Station: The manual inquiry stations 
are shown immediately in front of the Ramac disk stor- 
age units. These units permit fast interrogation of data 
stored in the computer core storage, in the disk storage 
unit, or on magnetic tape. The station consists of a 


Information is represented by a two-out-of-five code. 
The total number of possible combinations is 10, one 
for each numerical digit. As shown in Fig. 2 the bit 
positions are designated 0, 1, 2, 3, and 6. Each of the 
digits 1 to 9 is made up of two bits whose sum equals the 
number in question. Zero is designated by the one- two 
combination. Alphabetic information is represented by 
a dual-digit code. 

Data Storage 

A word in the machine code is composed of 55 bits. 
Fig. 3 shows a word which consists of 10 digits plus sign, 
or five alphabetic characters with an alphabetic sign. In 
each case, automatic recognition of the sign position 
indicates to the computer whether the information is 
alphanumeric or numeric and all operations are per- 
formed automatically according to the sign. 

Validity Checking 

All information transfers to and from storage within 
the 7070 computer are tested to insure that each digit 
has two "one” bits, no more and no less, for each five- 
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bit position. Because the checking is for an exact num- 
ber of “one” bits for every digit, this type of coding 
structure affords a complete and consistent self-check- 
ing of data flow. 

Data Transmission 

An important feature of the 7070 is parallel transmis- 
sion of data to and from core storage. An entire word, 
including sign, is moved all at once. A channel for 
parallel bit transmission consists of 55 lines, one for each 
bit in each of the 10 digits and the sign position. This en- 
ables a word in core storage to be moved in 6 jusec to or 
from core storage. Data are transmitted between the 
core registers within the computer at a 4-jusec rate. All 
information is transmitted in parallel with the exception 
of data transmitted to and from the magnetic tape units. 
Parallel transmission is represented by the parallel 
lines in Fig. 4. The accumulators all have serial paths 
connecting them to the core adder. There also is a serial 
data path to and from the input/output synchronizers. 

Arithmetic Operations 

The arithmetic unit of the IBM 7070 contains three 
accumulators, each with a capacity of 10 digits and sign. 
In addition to the accumulators, the auxiliary register 
and the arithmetic register are each capable of contain- 
ing ten digits and sign. The interconnection of these 
units is shown in Fig. 4. 

Addition 

An amount to be added to an accumulator is brought 
to the arithmetic register. The number in the accumu- 
lator is sent to the auxiliary register. The content of 
these two registers is added, one digit at a time, and as 
the result is developed by the adder, it is brought to the 
arithmetic register. At the conclusion of the operation, 
the result is sent from the arithmetic register to the des- 
ignated accumulator. 

Arithmetic Timing 

The duration of an arithmetic operation is determined 
by the size, in digits, of the factors. In an add instruc- 
tion, for example, it is determined by the size of the field 
or by the significant digits in the accumulator, which- 
ever is greater. Arithmetic operations take only the 
amount of time needed to perform the actual computa- 
tion; there is no time wasted in accumulating a full ten- 
digit number for each arithmetic operation. The addi- 
tion times for fixed point operation are shown in Table 
I. These figures include access time for the instruction 
and an operand. 


TABLE I 


Length of Operand 

Execution Time 

1, 2, or 3 digits 

48 Msec 

4, 5, or 6 digits 

60 Msec 

7, 8, 9, or 10 digits 

72 Msec 



Instruction Format 

Each instruction in the program consists of 10 digits 
and sign. The digit positions are numbered 0 through 9 
(left to right) as shown in Fig. 5. For most operations 
these digits are utilized as follows: 


Sign and positions 0-1 : operation code 

Positions 2-3 : indexing word 

Positions 4-5: control digits 

Positions 6-9: address 


Operation Code: The sign and first two digit positions 
provide for a maximum of 200 different operation codes 
of which 120 are currently used. In addition, some of the 
operation codes have multiple functions. These are 
accomplished through different values which are placed 
in digit positions four and five to further define the op- 
eration. For example, in a card operation, position four 
specifies a particular input or output unit. Position five 
defines the specific card operation such as read, punch, 
or print. 
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Fig. 6 — -Magnetic tape characteristics. 


Index Word: Positions 2 and 3 of the instruction 
specify the indexing word to be used. There are 99 index 
words in the magnetic core storage, each of which con- 
tains a 10-digit number with sign. They are stored in 
memory locations 0001-0099. The indexing word por- 
tion of a program step determines which of these 99 
index registers will be applied; 00 means no indexing. 

Control: In all of the arithmetic instructions, any por- 
tion of a word can be processed as easily as the entire 
word. Positions 4 and 5 of an instruction determine the 
part of the word that will be used. The digit in position 
4 denotes the left end of the field. The digit in position 5 
specifies the right end of the field. This is called “field 
definition.” The field definition feature means that sev- 
eral fields with like signs can be stored in a single word 
with no inconvenience to the programmer in processing 
an individual field. It should be noted that in all arith- 
metic operations using field definition, information ex- 
tracted from a word is shifted automatically to the right 
when placed into the designated accumulator. In stor- 
age operations, a number of digits equal to that specified 
by the field definition digits is extracted automatically 
from the right-hand portion of the accumulator, shifted 
left to the designated location of the word, and then in- 
serted in the word without disturbing the remaining 
digits. Field definition operations occur without any 
additional execution time. 

Address: The address portion of an instruction, posi- 
tions 6-9, may refer to the storage location of the data 
or the location of the next instruction. 

Indexing: As stated previously, memory locations 
0001-0099 also may be used as index registers. When a 
word is used as an index register, the digits have the fol- 
lowing value: 

Sign position: This specifies whether the actual value 
of the indexing portion shall be added to or sub- 
tracted from an address during an index operation. 

Digits 0-1: Not used. 

Digits 2, 3, 4, 5 : This is the indexing portion of the 
index word. Together with the sign of the index 


word, this portion is added to the data address of 
the instruction. All instructions are indexable. 

Digits 6, 7 , 8, 9: This is the fixed portion of the index 
word and may be used by the programmer to store 
constants, decrements, increments, or limits. 

Magnetic Tape Characteristics : The IBM 7070 Data- 
Processing System provides two types of magnetic tape 
units. These are the IBM 727 Model II or the 729 Model 
IV. Data from magnetic tape to the system are read into 
magnetic-core storage, and a record is written on tape 
from the core unit. Any group of locations in core stor- 
age can be used for these purposes. The tape control unit 
provides two channels, each connecting as many as six 
tape units with the main system. The total of 12 mag- 
netic tape units can be used in any combination. The 
comparative characteristics of the two tape unit models 
are shown in Fig. 6. The primary difference between 
them is the greater density of bit storage and passing 
speed of Model IV. Both units provide high-speed re- 
wind at a maximum speed of 500 inches per second. 

Successive records on tape are separated by a blank 
space called the interrecord gap. Every tape-read in- 
struction causes an entire record to be read: the read- 
ing is stopped by the interrecord gap. The size of a tape 
record has no limitation except the capacity of core 
storage used in transferring the information to or from 
the tape. When reading from the tape, even this restric- 
tion is not rigid if data from only a portion of the tape 
record are needed. The program defines the number of 
words and locations that can be read into core storage. 
Any information from the tape in excess of that amount 
is not accepted by the machine. 

The coding structure used on magnetic tape is a seven- 
bit alphanumerical code. This code is identical to that 
used by other IBM data-processing tape systems. Every 
character must have an even number of bits, and all are 
tested for this in every tape read or write operation. In 
addition, there is a horizontal check of each record. As 
the record is written on tape, a horizontal check char- 
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acter is created to make the number of bits even in each 
of the seven channels. The check character itself must 
have an even number of “one” bits to make the over-all 
total of bits even. 

The IBM 729 tape units have an additional checking 
feature on tape-writing operations. The reading head 
reads the tape just after it is written. This function is 
illustrated in Fig. 7. This check is performed automati- 
cally and provides immediate verification of the actual 
tape record. 

New Machine Characteristics 
Record Definition Words 

An important new concept introduced by the IBM 
7070 Data-Processing System is that of record definition 
words. These words define the first and last addresses of 
a block of data stored in core storage. One of these words 
is placed in storage for each block. The words may be 
used singularly or in tables of record definition words, 
as shown in Fig. 8. In each case, the last word in the 
table is signified by a minus sign. Record definition 
words are used for all data movement of more than one 
word at a time. This includes disk storage read and 
write, core-to-core block transmissions, movement of 
input data from the input buffers and output data to the 
output buffers, inquiry and reply information, and 
others. 

Scatter Read /Gather Write 

A powerful programming tool in 7070 operations is 
scatter read/gather write. This is illustrated in Fig. 9. 
A single record read from tape can be divided into as 
many parts as desired by the programmer. These parts 
are distributed to the different blocks of core storage as 
the tape is read. The figure shows how an inventory 
record is separated by field and category during the 
tape reading procedure. This operation is controlled by 
the table of record definition words shown in Fig. 8. 
This feature applies to writing on tape as well as reading 
from it. It enables the program to gather data from vari- 
ous blocks and automatically assemble them into one 
tape record. 

Record Mark Words 

Words containing the record mark character give an 
additional flexibility to the scatter read/gather write 
feature. Detection of the record mark word automati- 
cally denotes the end of a block in a scatter read/gather 
write operation. In scatter read, a tape record will not 
continue to fill the storage block if a record mark word 
is read into the storage area. Instead, it goes to the next 
block and begins to fill it with the characters immedi- 
ately following the record mark word. In tape writing, a 
record mark word, in core storage will signal the system 
to stop moving data from that block and start sending 
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Fig. 8 — Record definition words. 


from the next block. Tape read or write instruction de- 
termines whether the record mark word will be opera- 
tive. If not, a record mark word is treated as a normal 
alphabetic word. 

Zero Elimination 

When a numeric word in storage is written on tape, as 
many as five high-order zeros can be automatically 
eliminated. The sign of a word is combined with its unit 
position digit. When the tape is read, the presence of a 
sign combined with a digit indicates the unit position of 
a word. The core storage word will be filled automati- 
cally with zeros in the high-order positions to replace 
those eliminated during the write operation. The func- 
tions of record mark word and zero elimination are com- 
pletely automatic and require no additional execution 
time other than the time normally required to read and 
write magnetic tape at full tape rate. These functions 
result in a variable word and variable record size on 
magnetic tape. 
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Fig. 9 — Scatter read/gather write example. 


Channel Control 

The magnetic tape system of the 7070 has two chan- 
nels, each connecting as many as six tape units with 
magnetic core storage. The channels also connect to as 
many as four disk storage units. These disk units can be 
programmed to use either channel, whereas a tape unit 
can use only the one connecting it to the central proc- 
essing area. 

The primary purpose of the additional channel is to 
allow the IBM 7070 to perform two operations simul- 
taneously — reading two tapes, writing two tapes, or 
reading one and writing one. Each control channel per- 
forms one of the two tape read/write operations; while 
these simultaneous operations are taking place the pro- 
gram can continue. The function of the tape channel 
is shown diagrammatically in Fig. 10. 

Information is moved between core storage and disk 
or tape storage by the transmission registers. The pur- 
pose of the transmission registers is to change the type 
of transmission from serial to parallel, or vice versa, and 
to synchronize the speed of the tape or disk with the 
speed of magnetic core storage. Data must move to and 
from tape and disk storage serially, but they are always 
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sent to and from core storage in parallel. In reading tape 
or disk storage record, the characters are read serially 
into transmission register A until the ten-digit positions 
and sign position are filled. At this point they are sent 
in parallel to transmission register B. Register A then 
starts to fill up serially again with the next characters on 
the tape or disk record. The contents of register B are 
sent in parallel to the magnetic core storage location 
specified. In tape or disk storage write operations, data 
go in parallel to transmission register B, then in parallel 
to transmission register A, and serially to a tape or disk 
storage unit. 

The storage locations are controlled by the record 
definition register, one of which is used for each tape 
channel. The starting (or working) and stop addresses 
in the registers are obtained from a table of record defi- 
nition words stored in the memory. As a word is read 
from tape, the content of the working address is in- 
creased by one and compared with the stop address. 
After each word is read from tape, these addresses are 
compared. If they are unequal, the working address is 
increased by one and the next word is moved from tape 
to the new storage location specified. 

This process continues until the working and stop ad- 
dresses become equal, or until a record mark word is 
read. At that point, the last word is transferred, and the 
sign position in the record definition register is tested. If 
it is plus, the record definition word address register is 
increased by one, and a new record definition word is 
brought in from storage to designate a new block of 
words. When the sign of the record definition register is 
minus, this means the last record definition word in the 
operation has been reached and the operation is com- 
pleted. 

The operations described here are accomplished auto- 
matically within the record read or write time. The 
combination of zero elimination, record definition words, 
and record mark words provide a completely automatic 
method of obtaining variable word size, record size, and 
block size on magnetic tape. 

Segment Forward or Backward Space per Count 

These are new commands designed to allow auto- 
matic spacing over a series of tape records. The spacing 
operation can be performed with the tape moving in a 
forward or reverse direction. The tape is moved at full 
speed until one or several segment marks are sensed. 
The number of segments skipped is defined by a record 
definition word. The segment marks are recorded on the 
tape under program control and can be used for any 
type of demarcation required, such as a fixed number of 
tape records or a change in the file sequence control. 

Table Look-Up Operations 

There are three types provided in the IBM 7070. 
They are 1) equal or high, 2) equal, and 3) lowest. In 
equal or high, a sequential table is searched for the first 
entry that is equal to, or higher than, the search criteria. 


Table look-up equal searches a random table for an 
entry equal to the search criteria. Table look-up lowest 
searches a random table for the entry with the lowest 
search criteria. 

Floating Decimal Arithmetic 

An optional computing feature of the IBM 7070 is the 
use of floating decimal operations. This includes auto- 
matic floating point double precision operations which 
provide eight high-order and eight low-order Mantissa 
digits, each with its correct, two-digit characteristic. 

Automatic Priority Processing 

This new programming feature makes it possible to 
process more than one program during the same period. 
This procedure eliminates time lost in waiting for input/ 
output operation to be completed, since one program or 
another is constantly functioning. One of the programs, 
called the main routine, has a comparatively large num- 
ber of program steps. The others, called the priority 
routines, have relatively few instructions, but involve 
almost continuous use of card reader, card punch, 
printer, tape unit, or disk file. 

The main routine functions normally, while the tape, 
disk storage, or input/output unit in the priority routine 
is operating. These operations may include reading a 
card, punching a card, reading tape, writing tape, seek- 
ing a disk file record, reading a record file, or writing a 
file record. When any one of the preceding operations is 
completed, the main routine is signalled automatically. 
The priority routine carries out its program, stops its 
input/output or storage unit, and releases priority. The 
main routine then takes up exactly where it left off. It is 
possible to have more than one tape, disk storage, or in- 
put/output unit operating on a priority basis during 
the main routine. However, only the main routine can 
be signalled for priority; it is not possible to do this to a 
priority routine. If a second priority is ready while the 
first one is in progress, it will wait until the first one is 
completed. The main routine is resumed only when there 
are no priority routines waiting. 

Priority is determined by the setting of the stacking 
latches, shown diagrammatically in Fig. 11. There are 
stacking latches for the card input/output units, one 
for each tape unit, and one for each of the three read/ 
write heads in each disk file. Each program step in the 
main routine tests any or all of these latches to see if the 
priority routine is ready. These tests are made without 
any delay in the program steps of the main routine. 

There are four types of priority: card, tape, disk file, 
and inquiry. Card priority is caused by the completion 
of a card read, card punch, or print operation. A tape 
priority routine is initiated by the completion of a tape 
read or write operation, and disk file priority is started 
at the conclusion of a disk file read, write, or seek. A 
manual inquiry automatically initiates an inquiry prior- 
ity routine. In all cases, the program may test, set, or 
reset any latch. This allows complete control of priority 
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Fig. 11 — Stacking-latch scanning schematic. 


processing by the program although the entire scanning 
process is continuous and automatic. In any tape opera- 
tion requiring special attention, an automatic tape 
priority is initiated. This includes, for example, opera- 
tions in which an end-of-tape is recognized. 

Physical and Electrical Characteristics of the IBM 7070 
Data-Pro cessing System Printed Circuit Cards 

The system electronics is housed in a group of sliding 
gate cabinets, one of which is shown in Fig. 12. These 
cabinets are designed to accommodate printed circuit 
cards of the type shown in Fig. 13. These printed cards, 
which mount up to six transistors, are designed for auto- 
matic fabrication. Approximately 14,000 cards are em- 
ployed in the basic 7070 system shown in Fig. 1. A total 
of 30,000 alloy junction germanium transistors and 
22,000 point contact germanium diodes are used in this 
system. 

Chassis intercard signal wiring is provided by jumper 
wires which are connected to the card socket by wire- 
wrap connections. Wiring is accomplished automatically 
by wire-wrap machinery. This is shown in Fig. 14. Dis- 
tribution of power supply voltages to the transistor 
cards is accomplished by printed circuit strips. These 
features enable a major portion of the electronic system 
to be fabricated by automatic equipment. 

Physical Planning Requirements 

The use of a fully transistorized system has resulted 
in a reduction of the physical facilities needed to sup- 


port the IBM 7070 Data-Processing System. Table II 
compares the savings for the- IBM 7070 system with 
those of a comparable IBM 705 II. 


TABLE II 


Floor space reduced up to 

50 per cent 

Power reduced up to 

58 per cent 

Air conditioning reduced up to 

58 per cent 

Weight reduced up to 

50 per cent 


These reductions will result in substantial savings 
during the physical installation phase of each IBM 7070 
system. In addition, existing facilities will accommodate 
more powerful IBM 7070 systems within physical fa- 
cilities now using nontransistorized computers. 

Library and Programs for the IBM 7070 Data-Processing 
System 

An extensive library of programs is provided with the 
IBM 7070 system. These reduce the amount of time 
and effort normally required in a computer installation. 
The major automatic and library programs available 
are as follows: ■ 

Autocoder: This is a commercial assembler program 
based on comparable programs used with the 705 
systems. 

FORTRAN : This program is a complete compiler for 
scientific problems. The FORTRAN language is identi- 
cal to that currently used on the IBM 650, 704, 705, 
and 709 systems. 
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Fig. 12 — Sliding gate housing showing printed circuit cards. 



Sort and Merge: This program provides from two- to 
five-tape merging according to equipment available for 
the program. It defines a wide range of record length 
and control data. 

Report Generator: The report generator describes and 
assembles a program to produce any desired report in 
about 30 to 45 minutes over-all time, including sorting 
and merging in the preparation of the report. 

Utility Programs: These are a comprehensive, inte- 
grated set of utility programs to be used primarily with 
the Autocoder. They include the use of modern testing 
techniques, loads, memory prints, tape prints, etc. 

Ramac Programs: These utility programs are for 
the Ramac file, loading, unloading, and searching op- 
erations. They are similar to programs now provided for 



Fig. 14 — IBM 7070 sliding gate housing showing 
wire -warp intercard wiring. 


the IBM 650. In addition, a set of programs facilitates 
use of the chaining system, including an evaluation, a 
loading, additions, and deletions program, and optimum 
sequencing program. 

General: Several programs assist in input/output 
areas such as tape labeling, end of file, error routines, 
and restart procedures. An input/output routine auto- 
matically schedules simultaneous reading, writing, and 
processing functions for IBM 7070 card, tape systems, 
and Ramac systems. This new routine enables the pro- 
grammer to think of his job as a serial operation and 
automatically provides an efficient overlap of all func- 
tions. An analysis of total programming requirements of 
previous systems indicates that input/output has 
averaged approximately 40 per cent of the total coding 
job. With this new program, the IBM 7070 users can 
reduce the time substantially since they are not con- 
cerned with the implications of input/output schedul- 
ing. 

Simulation: An IBM 704 program simulates and tests 
7070 programs. A 7070 program simulates the 650 
card, tape, and Ramac machine with a similar 7070 con- 
figuration. This program will run at a speed at least 
equivalent to the 650 execution of a program. 

Summary 

The evaluation of a data processing system depends 
upon more than its inherent machine characteristics. 
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It includes the procedures, methods, and programs 
provided with the system and the manufacturers’ sup- 
port needed during the pre-installation, installation, 
and post-installation phases. All of these necessary and 
important features are provided with the IBM 7070 
Data Processing System. 

Specifically, they are: 

Balance System: The high speed of computing and 
internal flow of data are balanced by high-speed tape 
units, rapid access storage in the disk files, and high- 
speed card readers and punches. 

Maximum Utilization: Automatic priority processing 
allows efficient time-sharing and multiprogramming 
abilities for input, output, tape, disk file, and inquiry 
operations. 

Building Blocks: A variety of units in varying ca- 
pacities permit custom-made systems with the ability to 
grow as the user’s needs are increased. 

Application Range: The 7070 can handle a wide range 
of applications, including batch processing, in-line proc- 
essing, and computing. It covers the area of medium- 
to large-scale systems. 

Transistors: Solid-state components offer such ad- 
vantages as high reliability and reduced requirements 
for floor space, electric power, and air conditioning. 

Access and Use of Storage: Each word in core storage 
can be used for a program step, input or output. Scatter 
read and gather write minimizes the need for additional 


steps which arrange data or assemble them for tape 
operations. 

Simultaneous Operations: Transferring data to and 
from the system can be overlapped with computing 
operations. 

Fully Alphabetic : A complete 80-column card can be 
read in or punched for any combination of numeric and 
alphabetic data. 

Programming Logic: Field definition, 99 index words, 
single address instruction, and many other factors con- 
tribute to direct and simple programming logic. 

Variable-Length Records: Full flexibility in handling 
grouped records of variable length on tape is provided 
automatically by the scatter read and gather write fea- 
ture and automatic zero elimination. 

Reliability : Complete checking both of input, output, 
internal operations, and tape and disk storage insure the 
ultimate in performance. 

Programming Systems: Assembly programs and a 
number of other library routines assist in the planning 
and programming. 

Programming Testing: Programs can be tested prior 
to delivery permitting full operation immediately after 
installation. 

IBM Services: IBM offers training, planning and pro- 
gramming assistance, customer engineering, and other 
services. These are the vital steps necessary to insure 
that the man-machine-methods team is complete and 
will function properly. 


An Organizational Approach to the Development 
of an Integrated Data-Processing Plan 

GEORGE J. FLEMINGf 


T HE dictionary defines organization in four ways. 
One of these definitnios is: “The way a thing’s 
parts are arranged to work together,” and this is 
the one that most nearly describes the subject. 

The term integrated data processing has various 
meanings. Although originally used to describe common- 
language machine procedure it has gradually been ex- 
tended to include all phases of the processing of data 
and is often used to describe the wedding of two or more 
data systems. For the purposes of this paper, the 
broader definition will be used. 

The reasons for which data are processed may be 
categorized as follows: 

t Boeing Airplane Co., Seattle, Wash. 


1) Top-level management reports. 

2) Middle-management reports. 

3) Functional reports. 

4) First-level management and operating reports. 

Although these categories are a bit arbitrary, they 
are a useful classification for establishing the require- 
ments of an integrated system. Each of these categories 
competes with the other for data and each has the fol- 
lowing characteristics : 

1) Requirement for data and supporting records. 

2) Cycles on which they are produced. 

3) The degree of accuracy that needs to be main- 
tained. 

4) Manner or method of presentation. 
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The degree to which each category is compatible to 
mechanization varies as does the method of collecting, 
controlling, transporting, transcribing and processing 
of the data. 

Top-level reports are usually prepared by a skilled 
staff from data already processed and recorded. They 
often reflect the considered judgment of the staff and 
may be accompanied by documents that serve to an- 
alyze and evaluate the results they reflect. The volume 
of data required is relatively small, although data may 
be collected from many sources. These reports are us- 
ually produced on a monthly or longer cycle and the de- 
gree of accuracy required is often exacting. Current 
data-processing techniques are not usually adaptable 
to these reports. 

Middle-management reports are also generally com- 
piled from data that have been summarized and re- 
corded for other purposes. The interests in this category 
are as wide as the data-collection and processing sys- 
tems will permit. Requirements range from the critical 
to the superficial and the pressures that can be brought 
to bear are considerable. The cycles vary from weekly 
to monthly and the volumes of data required can be 
tremendous. The degree of accuracy is high and meth- 
ods of presentation are complex. 

The need to present data that are meaningful in a 
manner that requires minimum analysis is important as 
is careful disciplining of the requirements. Generally 
these requirements are adaptable to mechanization. 

The functional reports include payrolls, inventories, 
production control, and other similar types of data proc- 
essing that service the many functions of the company. 
The processing of large amounts of data collected from 
many sources is a major characteristic. The reporting 
cycle is most often weekly and the work usually involves 
the maintaining and up-dating of records. The degree of 
accuracy required is high as accumulative errors may 
lead to considerable distortion. 

The cooperation of many departments is usually re- 
quired in order to combine families of associated process- 
ing into functional applications. An example of such a 
combination is the labor handling function which would 
include data processing required for the personnel, 
timekeeping, payroll, accounting, treasury, and labor 
relations departments. Another example is the material 
handling function which might include some data- 
handling problems that originate in the purchasing, re- 
ceiving, storing, accounts payable and manufacturing 
areas of the company. 

These processes are highly compatible to machines, 
and are generally the forte of the data-processing service 
center. Most of the applications being processed on elec- 
tronic data-processing equipment are, at least for the 
moment, in these functional reporting areas. The new 
data available to middle management are largely a by- 
product of these processes. 

First-level management and operating reports are 


often the stepchild of our modern high-speed data- 
processing systems. 

The operating levels of management need reports that 
reflect yesterday’s results the first thing this morning. 
The need for corrective action is urgent and must be 
taken immediately if it is to be effective. The report for- 
mats must be easily understood by many people and 
the contents flagged so that a cursory scanning of the 
detail will reveal troubled areas. The volume of data 
required is often large, as these reports are prepared for 
the control of detailed operations. A relatively low level 
of accuracy may be tolerated. A daily cycle and sim- 
plicity of format are the important considerations. Data 
collected in or close to the area being served are normally 
the source of these reports. 

Format, collection systems and methods need to be 
individually tailored to the department served as the 
manufacturing or service processes they are designed to 
control usually differ within each department. 

The data-processing center is rarely equipped with 
either the machines or manpower to cope with this cate- 
gory of reporting. Yet, the data needed for these reports 
are usually the same data that are required for the func- 
tional reporting category. 

In order to promote efficient, practical, and economi- 
cal integrated processing, the data-processing center 
must maintain an influential position regarding the de- 
termination of data-handling procedures within the 
company. The quality and cost of data processing for 
the functional and middle-management categories of 
reporting will be dependent on the center’s success in 
establishing informational pipelines into all depart- 
ments and the development of uniform and standardized 
methods of operation. The center will need to maintain 
staffs of intelligent personnel trained in data-processing 
methods in order to set up and operate the sophisticated 
procedures that will be required. 

Experience indicates that even this is not enough. 
Only when machine techniques are combined with prac- 
tical operating experience can integrated results be suc- 
cessfully achieved. The establishment of informational 
pipelines is only useful when they can be carefully main- 
tained and vigilantly guarded. Sophisticated procedure 
may be satisfactory for the highly-skilled machine tech- 
nician but breaks down rapidly when entrusted to less- 
informed employees. 

It appears that we may have reached an impasse on 
our trail towards the integration of data processing. 

The requirements of the data-processing center are, 
in many respects, in conflict with aspirations of the de- 
partment being served. The forcing of informational 
pipelines and procedural restriction on other depart- 
ments tends to create awkward human relations prob- 
lems. 

The various departments in need of “on the spot” re- 
porting to keep their first-line management informed 
will be inclined to develop their own methods which will 
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compete for data and make standardizing of data-han- 
dling procedures difficult. Arranging these parts to 
work together is a challenging problem. 

The proposed solution is not an easy way out. How- 
ever, where the suggested type of organization has been 
established by managers who were aware of the many 
incompatible aspects of the problem and who have had 
a sincere desire to promote efficient methods of data 
processing, it has been quite successful. 

The proposed plan assumes the following: 

1) A centralized concept of data processing has been 
established, at least for the use of major equip- 
ment items. 

2) Integration of the various data processes is con- 
sidered advisable. 

3) The central data service is equipped to take 
proper care of most of the functional and middle- 
management reporting and record-keeping re- 
quirements. 

4) The center reports to a level of management that 
is influential in all departments of the organiza- 
tion. 

5) A capable staff of analysts and programmers is 
available. 

6) Initial applications on major equipment have 
been satisfactorily installed. 

The major feature of the proposed organizational 
plan is an outside (of the center) operation designed to 
provide individual service to the using departments 
and provide for the informational pipelines required by 
the center. Another way of describing these service 
groups would be branch or satellite data-processing op- 
erations. 

The satellite operation may be as large or small as re- 
quired to provide for the needs of the department being 
served and may operate minor data-processing equip- 
ment. The equipment may range from paper-tape- 
equipped adding machines, typewriters, and bookkeep- 
ing machines, to small punched-card installations or, 
where justified, small-scale electronic machines. 

Normally, these satellite operations will be located 
in the using departments in order to establish an at- 
mosphere conducive to the wedding of technical ma- 
chine-processing skills with the departmental experi- 
ence. 

The supervisor of the satellite operator will report 
directly to the data-processing center. However, he will 
be dedicated to serving the department manager to 
whom he is assigned and must be approved by the de- 
partment manager. It is expected that this operation 
will be staffed by a mixture of personnel drawn from 
both the data-processing center and the using depart- 
ments. 

The responsibility of this grbup will, in addition to 
serving the using department, also have the secondary 
responsibility for establishing and maintaining the pipe- 


lines necessary to provide the data-processing center 
with the data that are originated or perpetuated by the 
using department. It is further expected that as the 
satellite group acquires the proper skills they will as- 
sume an active role in preparing suggested methods and 
procedure for the approval of the department head and, 
when approved, assist in their implementation. 

Although the size of the satellite operation and the 
manner in which it is equipped will vary with the needs 
of each department, the duties will remain the same. 
Implementing the plan will require a clear understand- 
ing of the objectives and duties of each satellite group. 
A letter or memo signed by the data-processing man- 
ager and the department head is suggested in order to 
be certain that the operating ground rules are firmly 
established. These rules may be expanded as the group 
gains familiarity with the area. However, the line of re- 
porting must be to the data-processing center if maxi- 
mum benefits are to be attained. 

Organization within the data-processing center is 
flexible. However, as the number of satellite operations 
increases, it may be advisable to appoint a supervisor 
over these operations to insure that expected standards 
of service are maintained and to coordinate the pro- 
cedural and data-flow activities. 

Several problems in human relations are apparent in 
this proposal, such as the acceptance of the satellite 
operation into a department, the divided responsibilities 
of the supervisor, and the relationship between the de- 
partment head and the data-processing manager. Diffi- 
culties may be expected in this regard, but experience 
indicates that wherever a sincere effort is made to over- 
come these difficulties, they are less serious than those 
generated by other types of organization. The proposal 
is often questioned from the standpoint of the utiliza- 
tion of equipment and manpower. This factor, if present 
at all, is normally offset by superior service rendered 
the using department. The data-processing center will 
benefit by receiving preprocessed data under carefully 
administered controls and in presummarized form. 

The most serious problem will prove to be in ade- 
quately manning the satellite operation with qualified 
personnel. Most often the success of this plan will be 
reflected in the ability of the appointed supervisor. 

The establishment of these satellite data processing 
groups provides an organization which may well serve 
to overcome most of the day-to-day problems con- 
nected with integrated data processing. Its many bene- 
fits include: 

1) Providing the using department with a specialized 
data-processing service. When properly equipped, 
this group can prepare records and reports tailored 
to the department’s requirements without seri- 
ously interfering with the schedule of the central 
data service. It also provides a vital service to the 
data-processing center by maintaining surveil- 



234 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


lance over the data-collection system that is so 
necessary to high-speed processing. 

2) Such an organization provides a coordination 
medium through which standards of operations, 
procedures and practices may be established. It 
also provides a natural working medium for the 
exchange of mechanical techniques and depart- 
mental operating experience. 

3) By being directly related to the service center, the 
satellite group is in a position to call for and get 
expert service and advice from the service center. 
It may borrow a keypunch operator to relieve a 
temporary situation, obtain an experienced op- 
erator if required, or call in an analyst to assist 
them. The shifting of temporary overloads to the 
service centers standby equipment is facilitated 
and it should be noted that the center will have a 
built-in additional capacity for weekend or emer- 
gency work. 

4) The proposed arrangement is useful in establish- 
ing control over data-processing equipment to be 
used within the company. 

5) It allows departmental management to concen- 
trate on their prime objective with the assurance 
that records vital to the department’s functions 
will be properly maintained. The system analysis 


group within the service center is provided a 
means of reviewing departmental reports to assure 
themselves and the company that costly and un- 
necessarily redundant reports are not being main- 
tained. 

6) This plan also provides a means for giving the 
technically-trained employee management and 
administrative experience which will help to as- 
sure a pool of prospective management talent. The 
service center will be benefited as this plan pro- 
vides a line of advancement for their superior em- 
ployees. 

Precedents for the proposed plan are not too difficult 
to find. For example: The timekeeping organization, al- 
though reporting to one authority, provides services to 
many areas. Transportation units often operate from 
central pools while maintaining specialized services in 
remote departments. Purchasing normally concentrated 
in a center provides associated operations to service 
outlying branches. In fact, wherever overlapping serv- 
ices are a requirement, such organizations are not un- 
common. 

It seems probable that recently devised equipment 
combined with advanced techniques and organized in 
this manner will permit another step toward the goal 
of truly integrated data processing. 


Developing a Long-Range Plan for Corporate Methods 
and the Dependence on Electronic Data Processing 

NORMAN J. REAMf 


Introduction 

I HAVE been asked to speak to you on the subject of 
the impact of electronic data-processing innovations 
on corporate systems planning. This subject matter 
could be a recitation of how we have approached our 
planning effort at Lockheed followed by a recitation of 
how it has been adjusted from time to time by innova- 
tions announced by various manufacturers of electronic 
data processing equipment. 

However, I feel that this subject can best be ap- 
proached by first spelling out some of the major prob- 
lems facing all industry, pointing up some areas that are 
lacking in development. Then I shall attempt to discuss 
what appears to be a logical approach to these difficult 
management problems, what contributions electronic 

f Lockheed Aircraft Corp., Burbank, Calif. 


data processing has made to date, and what contribu- 
tions future innovations in electronic data processing 
will or will not make to the solution of this multitude of 
problems. 

While my remarks are directed to a corporate admin- 
istrative systems planning effort, we all realize that the 
subjects that are under discussion at this conference 
have broad social significance and we must stand ready 
to assume our responsibilities. 

Our economic system is designed in a manner in which 
a majority of the decisions affecting it are made by 
thousands of independent managements. This is an ad- 
vantage to our country and to industry, but it also 
poses heavy responsibilities on the shoulders of members 
of management. As Americans, we are convinced that 
this freedom of action awarded these managements will, 
when working within the proper social framework and 
business environment, result in the greatest good for our 
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citizenry. As a nation, we have flourished under the con- 
cept of individual freedom and I believe that it holds the 
only promise for our future. 

You are all aware that at the present time we are en- 
gaged in an economic “cold war.” Khrushchev has pub- 
licly announced that it is the Russian intention to bring 
the United States to its knees by means of economic war. 
He has also announced that at the end of their latest 
seven-year plan, Russia will have surpassed the United 
States in production and that the living standard of the 
average Russian will be better than we Americans now 
experience or will be experiencing at that time. 

Fantastic claims? Perhaps, but we are engaged in a 
life or death struggle and the boasted intent of our eco- 
nomic adversary must be taken seriously, for these 
same intentions were first announced by Karl Marx over 
one hundred years ago. Therefore, the seriousness of 
our responsibility cannot be overstated. We must seek 
ways and means of increasing our over-all productivity 
and continuing the elevation of our living standards. 

To maintain our cherished world position ahead of 
other world political doctrines will require that the 
thousands of independent managements in our country 
provide initiative and leadership in the use of the re- 
sources at their command. They are charged with the 
responsibility of securing greater internal efficiencies 
within their individual organizations and at the same 
time maintaining and improving their satisfactory rela- 
tionships with employees, customers, stockholders, and 
others. 

Drucker 1 stated in a recent article that one of the 
major problems of business is “the lack of any bridge of 
understanding between the ‘macro-economics’ of an 
economy and the ‘micro-economics’ of the most im- 
portant actor of this economy, the business enterprise.” 
He said that the only micro-economic concept to be 
found today in our economic theory is that of profit 
maximization which may mean short-run immediate rev- 
enue or long-range basic profitability of wealth-produc- 
ing resources that may have to be qualified by a host of 
unpredictables, such as managerial power drives, union 
pressures, technology, etc. But this fails to account for 
business behavior in a growing economy. 

According to Drucker, profit maximization is the 
wrong concept. The relevant question is: “What mini- 
mum does a business need?” not “What maximum can 
it make?” Companies that have attempted to think 
through the risks of business have found that the sur- 
vival minimum exceeded the present “maxima” in 
many cases. 

Drucker further pointed out that another crying need 
is the development of an integrated organization. 
Twenty years ago it was possible to see a business enter- 
prise as a mechanical assemblage of functions, but today 
we know that when we talk of business, functions 

1 P. F. Drucker, “Business objectives and survival needs,” J. 
Business, Univ. Chicago Press, Chicago, 111., vol. 31, pp. 81-90; 
April, 1958. 


simply do not exist. We speak of business profit, risk, 
product, investment, and customer relationships.' The 
functions are irrelevant to any of them, yet we also 
recognize that work has to be done by people who spe- 
cialize because no one knows thoroughly the ins and outs 
of a given function today, let alone all functions of a 
business. A basic problem then is how to transmute 
functional knowledge and functional contribution into 
general direction and profitable general results. 

To my knowledge the problem of integration has es- 
caped solution and will only be answered by devoted re- 
search and development. This will not be easy for we 
must recognize we are faced with the problem of de- 
veloping a means of measuring and controlling a com- 
plex assortment of interacting groups of variously moti- 
vated entities in a flux of decision-making situations 
that comprise a normal company. The degree of the 
complexity involved is usually in direct proportion to 
the size of the organization. 

Responsibilities of Management 

Let us turn for a moment to a discussion of the re- 
sponsibilities of management, for certainly the admin- 
istrative systems planning effort of any company must 
be addressed to the responsibilities of management. 

In the broadest sense we could define the responsi- 
bilities of management as the guidance, leadership, and 
control of a group of individuals toward a common ob- 
jective. This broad definition indicates a purpose, but 
fails to give us an insight of how results are obtained. 
Therefore, it is necessary to define the responsibilities of 
management by defining their five basic processes: 

1) Planning — that is, determining what shall be done. 
As used here, planning covers a wide range of de- 
cisions, including the clarification of objectives, 
establishment of policies, establishment of pro- 
grams, and determining specific methods and pro- 
cedures. 

2) Organizing— or grouping the activities necessary 
to carry out the plans into management units and 
defining the relationships among the executives 
and workers in such units. 

3) Assembling resources— that is, obtaining for the 
use of the business the personnel, capital, facilities, 
and other things needed to execute the established 
plans. 

4) Directing — -i.e., issuing management directives. 
This includes the vital matter of indicating plans 
to those who are responsible for carrying them out. 

5) Controlling— or seeing that operating results con- 
form as nearly as possible to the established plans. 
This involves the establishment of standards, 
motivation of people to achieve these standards, 
comparison of actual results against the prede- 
termined standard, and initiating necessary cor- 
rective action when performance deviates from the 
plan. 
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, All management engages in the processes I have 
enumerated, and it is clear that various individuals who 
comprise management spend varying amounts of time 
at each. 

The different members of management are divided 
into their functional specialties, such as sales, research 
and development, engineering, manufacturing, finance, 
industrial relations, etc. Each of these may in turn be 
divided into the five basic processes of management pre- 
viously enumerated. These are then established as two 
different approaches to the same management activities. 
For example, the vice-president of engineering must 
plan, organize, assemble resources, and direct and con- 
trol in the same manner as any other member of man- 
agement. His problems may differ in degree, but they 
are interrelated and interdependent upon those prob- 
lems of the balance of the management echelons. 

The increase in the scope and complexity of modern- 
day business has resulted in management recognition 
of the necessity of further development and increased 
use of scientific management techniques. Management 
recognizes that our simplifying processes must move 
forward in balance with business complicating processes. 
Unless our simplifying processes keep pace, we will be- 
come a casualty of our self-developed economic and 
business complexity. 

The next few years will see a tremendous increase in 
the use of data-processing systems in the development 
of new management methods; however, I wish to em- 
phasize that in my opinion they will only be a tool of 
management in the over-all improvement of the man- 
agement abilities. 

Objectives of Corporate Systems Planning 

In any corporate systems planning effort, it is axio- 
matic that we direct our attentions to the problem of 
determining what information is required to operate 
business in a coordinated and profitable manner. 

Good communication aids in coordinating the known 
activities of management. For instance, management 
must know promptly whether operations are proceeding 
in accordance with plans so that adjustments can be 
made when required. Moreover, there are a wide variety 
of activities, particularly those of a detailed nature, 
that are impractical to plan too far in advance, and 
coordination of these is achieved only when the per- 
sonnel directing and performing them have current in- 
formation upon which to base decisions. 

Management has awakened to the realization that a 
business is essentially controlled and directed by de- 
cisions based on information supplied by its data-proc- 
essing system. It has realized that major policy evolves 
from a whole series of day-to-day decisions based on the 
information currently at hand. And it has awakened to 
the bare fact that much desirable information is not 
available. 

Further, management is realizing that it is much 


simpler to set down the logic of problems in the field of 
physical sciences than it is to set down the assumed 
logic of a business executive when he is making a de- 
cision based on incomplete data. 

Having established that the communication of essen- 
tial information within a company is paramount in de- 
veloping an integrated approach to management prob- 
lems, it is necessary that we turn our attention to the 
administrative systems that are used to supply manage- 
ment intelligence. 

The basic objectives in the development of an inte- 
grated approach to administrative systems are: 

1) Development of improved management intelli- 
gence for use in decision-making processes. 

2) The reduction and control of time spans. 

3) Improved accuracy. 

4) Increased productivity. 

5) Reduced costs of operation. 

None of these basic objectives are new, but the advent 
of electronic data-processing systems has given new life 
to this whole area. We are now and will be seeking the 
proper application of data-processing devices in order to 
take full advantage of interrelationships between the 
data problems of various segments of management and 
to recognize appropriately the dependence of a number 
of these segments on the same basic input information. 

Planned Improvement 

In attempting to devise a planned administrative 
systems improvement program for Lockheed, we recog- 
nized we could not realize the desired results merely by 
studying, appraising, and converting our existing sys- 
tems and procedures. We were faced with a research 
problem of considerable magnitude, and it could only be 
solved by an analysis of the requirements based upon a 
knowledge of systems parameters. 

We found we had to solve this logical problem: How 
can we best take the basic information from our day-to-day 
operations and process and distribute it so as to maximize 
our profits and minimize our costs? The problem re- 
quired that we devise a well-planned program to be car- 
ried out by creative people acquainted with research 
methods. 

The most difficult part of any complicated problem, 
whether administrative or scientific, is the devising of a 
clear formulation and the establishment of a systematic 
manner of proceeding. The administrative-systems prob- 
lem being primarily concerned with the processing and 
flow of information has three basic parts: 

1) Formulation of the problem — We must determine 
here what inputs are required and what outputs 
are required. 

2) Logical design — dn this part of the study, we set 
up the internal relationships and describe the de- 
tailed information flow so given inputs will pro- 
duce required outputs. 
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3) Detailed systems design — This part is concerned 
with the techniques and the tools of the system 
and spells out how the operations required in 2) 
will be accomplished. 

We found that we had to attack these in their order 
(although at times it is possible to accomplish some of 
the logical design and the detailed system design in par- 
allel). 

Again, I emphasize that the most difficult area is the 
proper formulation of the problem. Until this is ac- 
complished, little return can be expected in attempting 
to attack small pieces of the existing system. 

The basic requirement is to tie the various segments of 
management into an effective whole through proper 
flow of information. It is binding the entire organiza- 
tion together into an effective, integrated whole through 
this flow of information that permits the information 
pipelines to serve not only as a means of improving day- 
to-day operations, but the projected operations as well. 

Regarding this as a logical problem we realized that 
it is not necessary that each person working on the for- 
mulation of the problem or the logical design have a de- 
tailed knowledge of the existing systems, accounting, 
manufacturing control, etc. The actual requirement is 
an ability to use research techniques and an objective 
attitude which will not necessarily be influenced by ex- 
isting administrative systems. 

Effect of Electronic Data-Processing 
Innovations 

In attempting to discuss the effect of equipment in- 
novations on the systems planning effort of a corpora- 
tion, I would like to preface my comments by stating 
that in my opinion industry in general has not learned 
to use efficiently the logical abilities of existing data 
processing systems. I do not mean to discount the tre- 
mendous contribution that electronic data processing 
systems have made to management, but I wish to em- 
phasize that in my opinion, we are only on the frontiers 
of exploiting their potential values. 

Certainly the advent of solid-state systems will in- 
crease considerably the over-all productivity of equip- 
ment systems and will improve their reliability. How- 
ever, until we improve our ability to use the logical de- 
signs of such new systems, we shall be using them as in- 
efficiently as we are using existing systems, and our 
costs will undoubtedly be proportionately higher than 
they should be. 

Since the delivery of the first Univac system to the 
Census Bureau in March, 1951, we have seen tremen- 
dous strides in the development of equipment systems. 
For instance, in the 700 series we have seen the 702 and 
then the successive introduction of multiple versions of 
the 705. Unfortunately, companies that have followed 
the trend of accepting all equipment changes that have 
come down the pike have done so at a tremendous cost. 


I believe changes of electronic data processing systems 
must be dictated by management’s ability to use them 
efficiently after all costs have been considered. Too 
many times equipment changes have been made without 
recognizing the multiple of such hidden costs as site 
preparation, turn-around costs, reprogramming effort, 
etc. A word of caution — evaluate any proposed equip- 
ment change carefully. 

With the rapid introduction of more advanced data- 
processing systems by various manufacturers, many 
managements are utterly confused and are continually 
pressed to answer the supposed problem of equipment 
obsolescence. It seems to me that equipment obsoles- 
cence falls into four categories: 

1) Economically obsolete — It would be beneficial eco- 
nomically to discard equipment and replace it 
with equipment of more advanced design. 

2) Technically obsolete — Better equipment is im- 
mediately available. It is not warranted to discard 
presently installed equipment, but if new equip- 
ment were being acquired, it would be more feas- 
ible to acquire the latest design. 

3) Technically obsolescent — Better equipment is in 
the design stage, but is not presently available. 

4) Conceptually obsolete — Better equipment is theo- 
retically possible, but has not yet been designed. 

If you will reflect for a moment you will recognize 
the four stages of obsolescence and realize that all 
equipments proceed from the stage of being conceptually 
obsolete to the point of being economically obsolete. 
However, the point of becoming economically obsolete 
is usually a long time after they become technically 
obsolete, because the marginal cost of operating and 
maintaining them is small. If we were in a detailed 
discussion of leased or purchased equipment, then we 
would have to concede that when equipments are being 
leased, the point at which they reach economic obso- 
lescence is much earlier than when they are purchased. 

There are two major reasons for a change in elec- 
tronic data-processing systems: 

1) It is desirable to decrease the unit cost, or 

2) It is desirable to do jobs that are not possible of 
accomplishment on existing equipment. 

One of these two must be the basic reason for change to 
equipments of increased speed or increased memory 
capacity. 

The physical difficulties as well as the economic prob- 
lems of changing equipment, both for the manufacturer 
and the user, acts as an inertia or a brake on the intro- 
duction of too many new models, each being a slight 
improvement over the old. In my opinion, an order of 
magnitude of three to five times in improvement is 
needed before a going machine can economically be 
supplanted. 

Because of the long cycles and lead times involved in 
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the development in the new data-processing systems, 
there is an upper limit to the speed with which new 
electronic data-processing systems can be introduced 
by any one manufacturer. However, as additional man- 
ufacturers start at varying places on the time scale, 
the result for the consumer is a practically continuous 
curve of increased technological advantage. The man- 
agement problem then becomes one of choosing at 
which point to make the change and of assessing the re- 
liability of the various manufacturers’ claims. 

Another factor that must be considered is whether or 
not the user of the proposed new electronic data process- 
ing system actually has a fully developed requirement. 
In other words, you can find operations where electronic 
data processing systems have been changed only to 
learn that they have acquired too large a system for the 
amount of work to be processed. 

To date, most of the attention of management has 
been directed towards gathering historical information 
and not too much progress has been made in the area of 
development of management information for determi- 
nation of a company’s objectives. The advent of elec- 
tronic data processing systems makes possible the ap- 
plication of techniques that had hitherto been impossi- 
ble by manual or electromechanical means. We can 
look forward to great strides in this area in the next 
few years in the development and application of mathe- 
matical techniques (operations research) to the prob- 
lems of management. 

The area of financial planning and control is wide 
open, for example, in the applications of scientific man- 
agement techniques. I am sure that electronic data-proc- 
essing equipment will be a major contributor to the 
eventual solution of many existing problems. However, 
the equipment will only be a means of developing an 
answer after we have determined what information is 
required and how it can be developed. Here again, I 
would like to emphasize that progress will not come 
about merely by having the equipment, but must be 
preceded by a fully developed definition of the problem 
and the logical design of the system required. 

In my initial work in the financial forecasting area, the 
first unforeseen difficulty was the problem of language. 
As a simple example, discussion with people in financial 
operations activities revealed instances of several dis- 
tinct concepts answering to the same name and a multi- 
plicity of names applied to essentially one concept. As a 
result, it was considered necessary to establish for the 
accounting aspects of the problem a structural frame- 
work which was reasonably complete, self-consistent, 
and within which the problem could be formulated with 
some assurance that the terms employed were clearly 
defined. Eventually it was determined that the problem 
could be solved in four steps. First, obtaining a mathe- 
matical model for the entire accounting feature of the 
forecast. Second, a study of prediction techniques used 
in obtaining basic information for the various inputs to 


the accounting structure. Third, a study of the pro- 
cedures necessary to make certain quantities of the out- 
put a maximum or minimum, while others are held 
under specified restrictions. Fourth, the application of 
high-speed data processing equipment to the results of 
the three previous studies. 

I might add that work in these areas does not lend it- 
self to rapid solution but is a result of long and arduous 
effort on the part of persons who are dedicated to re- 
searching this type of management problem. 

Information retrieval is another problem area that 
holds promise of solution. However, data processing 
equipment cannot make a contribution until such time 
as the problem is defined and a logical retrieval system is 
designed. 

These latter two are only representative of a host of 
management problems that must be sought out and 
solved to insure eventual management survival. 

Selling Electronic Data Processing 

Those of you who are in the data-processing area of 
your management are faced with a real challenge to sell 
your ideas to management as well as to employees. You 
not only have the problem of selling the use of electronic 
data-processing equipment, but you ultimately will have 
the responsibility of proving that the introduction of 
these systems has reached or exceeded the break-even 
point. The break-even point of electronic data-process- 
ing systems is still most dubious; in fact, many existing 
installations are not economically sound. You are faced 
with the challenge of the control of costs after installa- 
tion and of the continuous effort of insuring that the 
programming effort is of prime efficiency. 

Organization Change Required 

One of the most difficult problems which faces the ad- 
vocates of the use of electronic data-processing systems 
is to awaken managements to their responsibility of in- 
suring that they are using the systems to their best abil- 
ities. I agree that computers have been the impetus be- 
hind the tremendous interest that is now being focused 
on the development of improved administrative sys- 
tems, but at the same time most managements have 
failed to recognize the necessity of integrating their or- 
ganization in a systematic and purposeful manner. 
Usually these changes just happen over a period of time, 
arise from temporary expediency, or emerge as a solu- 
tion to a crisis situation. However, in most instances 
functional reorganization has not kept abreast of the 
technological change. 

The management organization structure is not invio- 
late and should be treated accordingly. In many cases 
changes in the organization structure can eliminate 
many of the procedures that complicate an administra- 
tive system, thus reducing costs and contributing to the 
simplifying process. 
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Consolidation of Scientific and Administrative 
Electronic Data-Processing Operations 

One of the most interesting and promising develop- 
ments in the application of electronic data-processing 
systems in business is the apparent determination of 
most manufacturers to develop a data-processing sys- 
tem that can be used for both business and scientific 
data processing with equal effectiveness. Also, the trend 
towards the development of these all-purpose data- 
processing systems in the medium scale field, with an 
ability to expand, opens the door for the economic use 
for computers in many smaller industries. It also gives 
good reason to speculate that large companies will find 
that the use of these dual-purpose systems will decrease 
considerably their over-all data-processing systems 
rental costs by consolidation of scientific and business 
data-processing operations and maximizing available 
computer time. 

The establishment of two definite data-processing 
groups within a given area of a business to handle inde- 
pendently scientific and administrative data-processing 
operations is a very costly venture. The now recognized 
means to handle many independently programmed op- 
erations simultaneously, that is, business, scientific, or 
both, eliminates many of the arguments that were for- 
merly used to substantiate the independence of the two 
establishments referred to above. From the standpoint 
of the future economics of data-processing systems in- 
stallation, I believe that this is a major step forward. 

Pre-Announcement of Equipments 

One of the most disturbing factors in the application 
of electronic data-processing equipments to manage- 
ment problems is the continued practice of some manu- 
facturers of announcing new equipment innovations far 
in advance of their actual design completion or availa- 
bility. I think that manufacturers would serve their 
purpose well if they would withhold announcements of 
new equipment until such time as they were able to dis- 
cuss these new equipment systems with the support of 
factual information. 

I recognize that the industry is extremely competitive 
and that various companies are jockeying for the best 
possible position. However, I believe that many of 
their sales practices have done much to retard progress 
by throwing many managements into a state of utter 
confusion by moving equipments into a business enter- 
prise long before the planned installation is ready for 
their use. 

I know of instances where certain manufacturers 
have attempted to have management withhold deci- 
sion on equipment in order to deliver their own new 
system many months in the future when full economic 
justification of immediately available equipments has 
been documented. Also, some manufacturers’ sales ef- 
forts have attempted to replace a competitor’s equip- 


ment with their own equipments when there is no eco- 
nomic justification to the user. 

Usually the sales tactic behind these equipment pre- 
announcements has been to prove to a potential cus- 
tomer that the manufacturer in question has a corner 
on the knowledge and abilities in the data-processing 
equipment field. In my opinion, there are many reliable 
manufacturers and I know of no company that has a 
corner in the area of development or know-how. 

Lack of Development in the Input Area 

Just as in the usage of punched-card equipments, the 
weakest area of electronic data-processing systems de- 
velopment today is in the input areas, i.e., the collection 
of input data at the point of origin. It is true that this 
area is one of the most difficult to define — and perhaps 
has been side-stepped by both the users and the manu- 
facturers because of this. Yet, I have the feeling that 
some manufacturers have long recognized the problem 
but have been more interested in protecting their invest- 
ment in equipments and card plants than in making a 
definite contribution to the state of the art. 

Electronic Data-Processing Systems Contracts 

I would also like to comment briefly on the present 
type of contract normally available for the lessee and 
the lessor of the equipments. Up to a short time ago, 
the rental contracts were certainly all vague and mean- 
ingless and were written solely for the benefit of the 
lessor. However, recently there has been a trend on the 
part of some lessors — that is, the manufacturers of the 
equipment — to write more definitive contracts than were 
previously available to the user. Even these are not 
definitive enough. It is time for the users to demand an 
even more definitive contract, one that charges the 
manufacturers with performance responsibility. I can 
see no reason why any manufacturer should make fan- 
tastic performance claims for given equipments during 
his sales efforts and then fail, if they are valid claims, to 
put these into a leasing agreement. Refusal only means 
that the abilities of the equipments have been over- 
stated. 

The present type of contract, as I previously stated, 
is a lessor’s contract, and I believe that its continued 
use through the years has caused the user to absorb 
many costs due to equipment malfunction that should 
have rightfully been borne by the manufacturer. 

Development of Programming Techniques 

Charges have been levelled at the manufacturers of 
data-processing equipment that they have failed to de- 
velop means of using the logical ability of existing sys- 
tems to the fullest extent through neglecting to furnish 
advanced programming aids and techniques. To a de- 
gree it is a valid criticism. However, I feel that this is 
buck-passing, and to date our failure to exploit more 
fully the use of presently available data-processing 
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equipment in business is caused by the lack of the de- 
velopment of logical systems that will fully exploit the 
logical abilities of the equipment available. Too much 
effort has been expended on trying to transfer available 
human systems to these equipments rather than at- 
tempting to develop a proper definition of the logical 
system required. 

Conclusion 

Electronic data processing is becoming a byword in 
the evolution of the techniques of a scientific approach 
to the problems of management. The equipments are 
not an end in themselves and cannot be considered a 
panacea for the ills of management. Rather they are a 
tool of management. Their contribution to the improve- 
ment of management is entirely dependent on how well 
the problems of management are defined by the indi- 


vidual practitioners and how ingenious they are at de- 
veloping means of formulating information for manage- 
ment review and decision-making processes. 

Again, we are only on the frontiers of the potential 
we seek. Hard work and ingenuity will bring success. 

We must move carefully and at all times must be in a 
position to justify our activities. The introduction of 
improved electronic data-processing systems will un- 
doubtedly contribute to the advancement of the state 
of the art, but the feeding of bad inputs into faster and 
more capable equipments will only generate more bad 
information at a faster pace. 

I have tried to point out some of the difficult areas 
we are encountering to avoid overoptimism. Yet, there 
is no reason to be overpessimistic. Our eventual goal 
can be attained, and with the high stakes involved, the 
significance of the results warrants the all-out effort. 


A General Approach to Planning for Manage- 
ment Use of EDPM Equipment 

GOMER H. REDMOND t 


D URING the past decade we have all been aware 
of, or have played a part in, a maximum effort on 
the part of certain dedicated people. The objec- 
tive of this informal effort seemed to be to convert all 
clerical and computational work efforts into an auto- 
matic, “fire-all-the-clerks, we-can-do-anything” ap- 
proach to all known business practices. These dedicated 
people were mainly comprised of data-processing manu- 
facturers, management consultants, scientists, engineers, 
and administrative line and staff executives and assist- 
ants. They usually had a good grasp of a specific prob- 
lem and felt that this problem or series of problems were 
sound EDP applications and by extrapolation, pro- 
ceeded to teach a number of management executives the 
economics of EDPM installations. 

The influence of this effort and the span of time over 
which it took place were both beneficial. Decision- 
making management level executives encouraged and 
at the same time discouraged the advance of clerical 
automation. Equipment marketing people were forced 
to compete in areas that they frankly knew were not 
practical. “If I don’t submit a proposal for this applica- 
tion, competition will get the business and it’s a rough 
row getting back in,” were frequent comments that we 


t Chrysler Corp., Detroit, Mich. 


all have heard upon questioning EDP salesmen regard- 
ing doubtful or submarginal installation. 

Management consultants were asked into the clerical 
automation area by top executives who respected pre- 
vious neutral and objective work assignments, many of 
these management consultants accepted work assign- 
ments with the assumption that old and proven “stand- 
ard” techniques would serve their purposes in this area 
as they had in many others; others went in with good 
staffs and did a “down town” yeoman-like job. Still 
others capitalized on the confusion and helped equalize 
mass optimism by forecasting dire results if EDPM in- 
stallations were contemplated and planned without 
specific help from their firm. 

Not to be outstripped, many business and newspaper 
writers and reporters joined in and began to point out 
the miracles of electronic hardware and its possible 
effect upon American business organizations and opera- 
tions. Articles in magazines, in newspapers, in trade 
journals praised use of EDP equipment as a new indus- 
trial revolution and played up the hardware and its 
speed with almost no references to planning and applica- 
tion problems and costs. Other articles damned the rapid 
acquisition of EDP equipment. These write-ups blew 
way out of proportion some pioneering marginal com- 
mercial installations, threw serious doubt into manage- 
ment’s mind as to the capabilities of their EDP planning 
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people and condemned organizational structure for per- 
mitting such a cancerous growth to survive. 

Systems people and EAM operators also feeling a 
need to keep abreast were schooled in electronics. In 
many cases, they forgot traditional good systems con- 
cepts by tailoring work areas and good systems con- 
cepts to the convenience of a preselected “brain”; this 
was done to get into business as soon as possible. 

To sum this up I would like to quote two individuals 
who have different viewpoints and yet a mutual under- 
standing of this problem. John Diebold, President of 
John Diebold and Associates, in an address to the 
Eleventh International Management Congress said, 
“Automation has presented management with a major 
new problem. As yet management has not faced up to 
this problem and is hardly even grappling with it in any 
true sense. This is through no lack of energy or good in- 
tentions. On the contrary, the very activity of manage- 
ment in this sphere attests to the progressive spirit and 
desire for improvement that characterize the modern 
manager. The trouble lies elsewhere. Automation has 
turned out to be a much more complex and difficult 
problem than was originally thought. This being the 
case, the current disposition to minimize its revolution- 
ary and novel aspects is more hindrance than help in 
putting automation to work.” 

The other gentleman that I must quote, to success- 
fully set the stage for this general topic, is an EDP 
equipment manufacturing top executive. He is one of a 
half-dozen human beings who have successfully bridged 
the gap between an exacting technical knowledge of 
EDPM and the work that management should plan for 
this equipment. Just recently, he asked when American 
military, industrial and institutional management was 
going to use properly the equipment already produced 
and plan the right scientific and data-processing jobs 
for it to accomplish. He went on to point out that there 
is enough data handling equipment already produced to 
process all problems that our economy needs to handle 
— providing this equipment is properly programmed and 
properly distributed. Sound planning is the answer. 

The need for planning prior to major commitment in 
most endeavors is apparent. A football team plans so 
that all eleven men work together to achieve a first 
down or a touchdown. In business, profit planning, 
sales planning, product planning, and production plan- 
ning are present in most organizations and normally are 
predominant factors in successful achievement of ob- 
jectives. 

Formal planning has not been adopted for orderly 
consideration of electronic hardware by all business, 
military, and other forms of organizations. This is 
primarily due to the apparent desire of many functions 
of an organization to 1) jump on this bandwagon (or 
get left behind), 2) to control this monster which could 
completely dissolve current organization, 3) to merely 
extend current punch card applications through the 
electronic barrier, or 4) to ignore the entire subject 


until EDP has become a proven practical factor in other 
organizations. 

With the possibility of huge expenditures for pro- 
gramming, for building computer sites, for selection of 
appropriate systems applications, and for the hardware 
itself, an organization should begin to plan for success- 
ful planning, for it is essential in all forms of planning 
that a fundamental design and approach to planning a 
subject be created. Such a design for EDP planning 
must take into consideration scope, organization, and 
the ground rules or administration of a planning func- 
tion itself. 

To discuss this on a general basis, due to the number 
of extremely different forms and types of organizations 
represented here, let us first examine some possibilities 
of designing “scope” into an EDP planning function. 

Scope 

Planning latitude may be extremely broad if you are 
functionally responsible for data processing in a multi- 
plant industrial complex or it can be extremely limited 
if the characteristics of the enterprise are small and proc- 
essing requirements are simple. To my knowledge, there 
is no formula for determination of planning scope. Some 
questions may help in formulating this portion of your 
design for EDP planning: 

1) Should the plan point long range, at the immedi- 
ate, or both? What should the plan achieve? Are 
planning objectives understood by all concerned? 

2) Is the planning concerned with a part or all of a 
specific application? 

3) Should planning review present punch card appli- 
cations, present manual operations, or should it 
also encompass data processing for new manage- 
ment decision-making techniques? 

4) Is planning required for all portions of an indus- 
trial enterprise or are we only concerned with a 
division, a plant, or a single function at corporate, 
division, or plant level? 

5) Should planning be pointed toward data-process- 
ing economy or in specific work areas do accuracy 
and speed take precedence? 

The scope of planning cannot be completely deter- 
mined without a review of the planning organization 
and its place in the organization. 

Organization 

1) Where should the staff work of EDP planning 
take place in the organization? Is central planning 
required for product planning, for profit planning, 
etc? Can economies be gained by centralized plan- 
ning or are products and data-processing prob- 
lems so different that plant or divisional planning 
is essential? 

2) Can a combination centralized-decentralized form 
of planning work? Does this provide greater flexi- 
bility? 
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3) How should the planning group (s) be organized? 
What should the relationships be to systems em- 
ployees? 

4) What should be the make-up of the planning 
group or groups? How many staff workers should 
participate? What background, educational and 
experience levels should be utilized? 

5) Should committee action be employed? 

6) Should line employees contribute? 

7) What part, if any, should outside consultants per- 
form? 

8) Equipment manufacturers — should they be asked 
to contribute data or ideas during planning and 
decision making? 

After a proper organization is decided upon, manage- 
ment must decide upon a proper set of ground rules or 
administrative routine within which the EDP planning 
will operate. 

Planning Ground Rules 

1) Is planning to be specific or broad? Is the plan to 
be fixed or flexible? What happens to the plan if 
broad procedures or organizational patterns are 
changed during or after completion of the plan? 

2) How much documentation will be required? 
Should progress be periodically or occasionally re- 
ported, and if so, to what level or levels of manage- 
ment? 

3) What preplanning bench markers can be estab- 
lished to enable management measurement of 
planning progress? 

4) Should management decision be requested during, 
or only upon completion of, a review of the entire 
planning scope? 

5) In what detail should management participate 
during planning steps? What information must 
management have to make final decisions prop- 
erly- — all the detailed technical data or broad ap- 
proximations of cost and estimates of anticipated 
results? 

6) Should fact finding be accomplished by a planning 
staff or should all components contribute factual 
data? 

7) Is it necessary to sell staff work and EDP planning 
prior to the planning action? 

These question areas are representative of the many 
decisions that should be made before initiating a thor- 
ough corporate-wide planning survey, or in performing 
a feasibility study at division or plant level. The list 
of questions is not a complete check list, but may be 
helpful in creating a list for a specific organization. 

Planning work that can be measured and evaluated 
is difficult in itself, but when initially directed toward 
EDP, difficulties in the planning process may be ampli- 
fied due to communication problems and due to many 
organizations’ inherent caution and fear of possible 
changes. 


Planning work and staff work are not new to most 
American military men or to business firms. An im- 
portant contribution to the acceptance of a complex 
program is the attainment of prior success to create an 
aura of reliability in those executives that have so per- 
formed. Therefore, design for the plan is especially im- 
portant during an initial planning process; in many 
cases the “chips” are great, and as in baseball, the bat- 
ter either gets on base or he is out. In many more cases, 
however, successful use of EAM equipment plays a 
significant role in preparing for formal planning. 

At Chrysler our planning had to be initiated by super- 
imposing a fact-finding and investigatory stage over an 
existing broad and, in many cases, extremely good data- 
processing system. This system utilized medium and 
large EDP equipment along with EAM equipment at 
various levels of organization. It was largely created by 
breaking off an existing installation (manpower and 
programs) and, for geographic purposes, a new installa- 
tion was then originated. A good amount of systems 
work had been accomplished, but this area needed ex- 
tensive work to restrengthen approach concepts as well 
as to institute much-needed machine room documenta- 
tion, administration, and control. A total of 42 machine 
rooms were in existence with over 1200 employees work- 
ing on data-processing equipment. The comptroller was 
then informed of the need for a formal plan to investi- 
gate work areas and techniques ; this review would then 
expand into an over-all systems study that envisioned 
examination of source data pickup to finished manage- 
ment action or information reports. He agreed. 

EDP planning at Chrysler is the staff responsibility of 
the Manager, Systems, and Procedures. Data processing 
is an important management instrument in our com- 
pany, and we have tailored our systems and procedures 
organization to place proper emphasis and yet not over- 
emphasis upon the subject of data processing. Our man- 
agement believes that a correct balance between the 
work and programming effort of good systems concepts 
planning and mechanical and EDP planning people 
must prevail. 

An Assistant Manager, Corporate Systems and Pro- 
cedures, has been delegated the specific and full-time 
responsibility of EDP and EAM planning and control. 
His organization comprises three sections, each headed 
by a supervisor; they include 1) Operations Research, 
2) EDP and EAM Administration and Control, and 3) 
Planning. Our Planning Section is made up of eight em- 
ployees who have between 3 and 8 years of practical op- 
erating and staff experience in this specific field. Most 
have college degrees in mathematics, in business, or in 
engineering; however, some of our finest staff work has 
been accomplished by personnel with much less formal 
education. 

Initially our planning was purposely geared on a 
broad basis to securing factual data and, on the first 
pass, to point out specific areas in which immediate new 
or remedial work should be initiated. Our preliminary 
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plan was half completed when we discovered a need; a 
need for a better way to handle those jobs (in a large 
industrial complex) that due to low unit costs, minimum 
elapsed time requirements, and with little additional 
communication costs, made a large computer a natural. 

As in any flexible planning technique, new base lines 
and facts were developed, and after proper delineation 
and documentation of the new computer’s planning and 
operating organization, the over- all plan was reinstated. 
You will note that specific and detailed operating plan- 
ning, after a management decision has been made, is 
separated from the staff planning work and in effect pro- 
ceeds through its own planning cycle within a specific 
predetermined scope and time limitation. 

The broad-gauge, corporate-wide planning is then 
continued with new factors and perhaps with new 
ground rules that have resulted from the major decision 
passed down by the top management. Continuation does 
not mean starting over, but it does embrace a review of 
the planning process to date. This planning process en- 
tails 1) reaffirmation of objectives, 2) analysis of the 
new situation, 3) determination of planning routes and 
manpower requirements, 4) choice of alternate course 
to be initiated, 5) conversion of choice into action steps, 

6) creation of internal and external communication 
channels, 7) determination of planning and methods of 
appraisal, and 8) then a procession through these same 
steps after evaluations indicate that changes in the 
planning process should be incorporated. 

Specifically, our current plan which will be presented 
again to top management in the near future has the fol- 
lowing objective. 

It is the objective of the Corporate Systems and Procedures De- 
partment, through the development and eventual approval and 
adoption of this and succeeding planning efforts, to provide Chrysler 
Corporation with the most effective management information sys- 
tems in the Industry. 

The need for constant and continual planning is a fundamental 
requirement to successfully integrate, coordinate and effect such a 
dynamic data handling system. Competitively this is a must for 
Chrysler. 

In general, we have just scratched the surface in the data handling 
field with the mechanization of some of the “safe” clerical functions. 
These functions have provided valuable experience in data processing 
techniques. On occasion, we have used the logical abilities of data 
handling equipment to provide meaningful data for the control and 
regulation of manufacturing processes and to directly help to improve 
our product quality. However, at the present time most of the data 
generated merely reports the status of operations, produces necessary 
checks and notices, but is rarely employed to provide recommended 
actions for timely operational decisions. Ultimately, a data handling 
system will be evolved that will provide this needed decision-making 
information, and select action courses for more consistent and precise 
control at all levels in the organization. The program to effect this 
plan will not remain static for any long period of time, but must be 
continually reviewed to reflect new data processing product develop- 
ment, new systems concepts, and progressive changes within the 
corporation. 

After formally stating a purpose, it is frequently nec- 
essary to state the base line, or present situation, which 
helps form the scope and direction of our planning in 
the minds of the planners and decision makers. The 
present status should be concise and complete; any de- 


cisions that have been firmly made, but not as yet im- 
plemented, should be “baked in” as part of the plan 
initiating point. As decisions are made, this part of a 
plan package will change; such decisions should be for- 
mally reflected and should be documented so that at 
any time that management decision making may occur 
(and this process could be almost continuous) an up-to- 
date document can be presented to insure fact-founded 
decision. 

A typical Present Status should include: 

1) A statement and exhibits to indicate present data- 
processing installations. 

2) An exhibit indicating the size, geographical loca- 
tion of these installations, along with an indication 
of communication media. 

3) The basic type of equipment employed and their 
cost or rental. 

4) An analysis of the number of people involved in 
data processing and the annual cost including 
fringe benefits. 

5) A statement and chart that indicates the current 
organizational placement of data-processing com- 
ponents throughout the corporation. 

6) A chart indicating functions and work areas that 
are being handled by data processing for each in- 
stallation within the corporation. 

7) A detailed statement indicating the history of a 
representative data-processing component ; how it 
grew, what jobs it is now performing, what basic 
costs would be incurred if automated techniques 
were not employed. 

8) An indication of the training and supervisory 
talents that present job incumbents possess. 

9) Examples of areas in which improper control and 
improper organization, lack of documentation, 
poor systems concepts, etc., are contributing to 
erratic and faulty data processing. 

Current planning progress should then be indicated 
to show positive staff and line, central and decentral- 
ized work effort that has resulted from the over-all 
planning efforts. This should include such progress as: 

1) Training programs initiated by corporate or ma- 
chine manufacturers to increase the scope of em- 
ployees’ work efforts through greater knowledge 
of new concepts or machines. 

2) Training programs to teach supervisors and ma- 
chine procedural planners the latest administra- 
tive, documentation, and control techniques. 

3) Installation progress of approved EDP equipment 
or of new or combined EAM installations. 

4) Major conceptual developments initiated at any 
EDP installation. 

5) Disclosure of preliminary detail plans that look 
promising (although in which sufficient staff work 
has not been accomplished to secure final deci- 
sion) . 



244 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


6) Development of community or employee com- 
munications designed to “sell” data processing. 

7) Progress made in report and problem solution 
areas entirely foreign to data processing at this 
time; using techniques such as simulation, or by 
tailoring mathematical models of problem areas, 
thereby establishing entirely different methods 
for better and more accurate management deci- 
sion. 

The next section of a planning package may be a re- 
view of the development of short-range programs, as 
they relate to a long-range program. This program de- 
velopment review should include: 

1) A concise statement of problem areas; personnel 
outside of the planning section should be asked to 
supplement and/or modify a list of these areas. 

2) A brief statement as to the proper remedial action 
that must be taken to correct existing problem 
areas. Again, the considered opinion of line and 
staff executives should be requested during the 
formulation of the short-range action plan. 

3) New problem areas currently not processed me- 
chanically or electronically; these should be de- 
fined and should be investigated in terms of possi- 
ble application and solution by use of different 
source information or processing techniques. 

4) A firm schedule against which periodic progress 
reports may be measured. 


5) Using the aforementioned four steps, a general 
statement recording 1-, 3-, and 5-year goals of in- 
formation handling should be spelled out. This 
should include a statement specifying the planning 
philosophy and the general direction in which 
the plan is approaching the stated objectives. 

In closing, EDP planning is a repetitive, dynamic 
process that, to be effective, is flexible and when inter- 
mediate decisions are made, should be re-created. It 
may be worthless if used as a pure academic exercise. A 
plan is considered and then may be rejected or partially 
or completely adopted. In any event, it has served a 
purpose. EDP planning should be tailored to the total 
enterprise, to top management individuals, and to the 
background of previous good or bad experiences with 
EAM or EDP applications. Planning recognizes that 
staff members do not have a patent on brains. Factual 
and idea contribution must be encouraged and de- 
veloped. It should not always result in an EDP applica- 
tion; in many cases, byproducts of EDP plans and feas- 
ibility studies are more valuable than the EDP results 
themselves. 

De-emphasizing hardware, it should place much more 
stress upon approach concepts and related facts. 

EDP planning should be kept practical and should 
engender continuing enthusiasm on the part of the plan- 
ning staff. It is not an end in itself. The only reward of 
good planning is its influence upon management, which 
results in the achievement of stated objectives. 


Dynamic Production Scheduling of Job-Shop 
Operations on the IBM 704 Data- 
Processing Equipment 

L. N. CAPLANf and V. L. SCHATZf 


F IRST, let us explain what we mean by job-shop 
operations. In this case we are talking about our 
Jet Engine Department in the General Electric 
Company, Aircraft Gas Turbine Division. The Jet En- 
gine Department is the step between the basic concept 
development and the production shop. Their manufac- 
turing facility is responsible for building engines and 
components and for testing these engines and compo- 
nents to prove practicability of design, serviceability, 
manufacturability, and reliability. The prototype en- 
gines are also within their manufacturing responsibility. 

f General Elec. Co., Business Systems Computations, Evendale, 
Ohio. 


From this, one can visualize the type of operation 
about which we are speaking. It is largely one where 
Engineering requests parts to be manufactured for in- 
stallation into an assembly which is to be tested. The 
important thing is that there are a large number of re- 
questors and relatively few items per request. Stated 
another way, this means no schedule of incoming work. 
Hence, the name, job shop. Fig. 1 represents the flow of 
operations just discussed. 

Now that we have seen what a job shop represents 
in our case, consider the size of our operation. There are 
approximately 9000 operations on order in the manu- 
facturing shop at any one time. These operations may 
represent two or three thousand requests with several 
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Fig. 1 — Jet Engine Department operations diagram. 


machine operations per request. This shop contains 50 
machine-tool areas. An area in this case represents ma- 
chines which would perform one type of function — for 
example, 5 lathes or 3 grinders or 15 inspectors. The 
cycle time for one job through the shop averages about 
22 days. 

At this point the problem becomes evident. We must 
be able to provide a schedule which will process work 
through the shop in the most efficient manner. We must 
work to a schedule that is set by the requestors’ due 
dates. We must minimize waiting time for the part re- 
quested, we must minimize farm-out to vendors while 
idle time exists in our own shop, and we must minimize 
the idle operator time in our own machine shop. Also, 
we would like maximum utilization of our expensive 
machine tools. The scheduling must be completed 
quickly; otherwise, it is obsolete by the time it is to be 
used. In addition to the size of the job to be done, other 
complications to scheduling include the necessity to 
make allowance for weekends, allowance for holidays, 
consideration for overtime authorizations for extra 
shifts, holidays and weekends, sequential machine op- 
erations, simultaneous completion of machine opera- 
tions on all parts going into an assembly, and operations 
being performed by outside vendors. It is difficult to 
develop the start and finish time for each job when all 
these facts are considered. 

A punched-card and tabulator system had been used 
with some degree of success in the past. The primary 
output of this system was a punched card provided to 
the shop foremen showing the job to be worked on and 
the starting date. In spite of the fact that this is a rela- 
tively small shop, the scheduling became nearly impos- 
sible on tab equipment. The scheduling was inefficient 
to the extent that we were having to farm out work and 
at the same time we had idle machine time in our own 
shop. The resultant loss ran into thousands of dollars 
monthly. Also, in one case we were scheduling for two 
months’ backlog of grinding work when actually less 
than two days’ work was available for grinding. 

This was more or less the climax which brought about 
consideration of the computer by the Production Sched- 
uling people. Initially they were discouraged because of 
some of the things they had been told. For example, a 
great portion of the machine work would be sorting of 
the 9000 work-information cards into priority sequence 
and then resorting into job-order sequence. The job num- 


ber is 19 digits in length. Therefore, to sort and resort the 
information deck would require passing 342 ,000 (9000 X 2 
X19) cards through a conventional sorter. This would 
take well over 10 hours and the schedule would be obso- 
lete before the sorting was completed. The opinion was 
that sorting of this type would be prohibitive in cost on 
a computer. A second opinion held that the great num- 
ber of exceptions would make this scheduling impracti- 
cal to program for a computer. Nevertheless, Produc- 
tion Scheduling brought the problem to the Computa- 
tions facility to see what could be done. After some 
study, we decided that the job could be accomplished on 
our 704. Painstaking programming would be required 
and new techniques or so-called “tricks” could be de- 
veloped to do the sorting in a practical time. The exist- 
ing card system was adapted to computer handling with 
a bare minimum of change. One man from Production 
Scheduling, working half-time, and a man from Com- 
putations, working full-time, completed and placed the 
program in operation in about 10 months. 

Let us consider the system and the computer’s role. 
When a work authorization is received by the machine 
shop, the scheduling group assigns a job number and 
priority, and breaks the order down into operations 
and estimated hours required for each operation. Cards 
are then made up for each operation. The card contains 
priority, job number, operation number, type of over- 
time allowed, men required, work area and alternate 
work area, a minimum starting date, estimated hours, 
part name, quantity, and some other information non- 
pertinent to the computer operation. The cards for new 
work are brought to Computations on a weekly basis 
along with cards on work in process. A date card to tell 
the machine when the scheduling begins heads the deck. 
Priority cards are placed ahead of the operation’s cards. 
(See Fig. 2.) Within Computations these cards, 9000 on 
the average, are placed on tape via card-to-tape con- 
verter. The tape and program are placed on the ma- 
chine. As the tape reads into the machine, the informa- 
tion is checked for possible errors, such as blanks in a 
numerical field, impossible data such as February 29th, 
non-leap year, and 31 days in 30-day months, or out-of- 
sequence job number. Some errors may be corrected. For 
example, if a 31st day is found in a 30-day month, the 
machine will make the day read as the first day of the 
next month. All errors are listed by job and operation 
number. If 50 errors are detected in the input deck, the 
operator is notified by an on-line comment to halt the 
program. The computer then prints the rest of the er- 
rors, if any, in the deck. As the information is being 
checked, it is also being transformed into binary words 
so that all scheduling information for the 9000 cards 
may be fitted into the 32,000-word memory at one time. 
This is still not possible so that job number (2 words) is 
placed on a tape sequentially. The technique for sorting 
this information is as follows. The memory was split into 
three parts. Storage 0-9000 will contain words with 
priority number and sequence number stored sequen- 
tially in the same word; 9001-18,000 contains words of 
scheduling information in sequential order; and 18,001- 
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Fig. 2 — IBM cards making up data deck. 


27,000 contains other words of scheduling information 
sequentially. (See Fig. 3.) 

The first 9000 words are sorted into priority sequence. 
By looking at the first word and adding 9000 and 18,000 
respectively, the computer may pick up the second and 
third words belonging to that operation. Sorting into 
order requires approximately two minutes. We write 
this on a tape. Similarly, we read back into spaces 
9001-18,000 and 18,001-27,000 the job number and pick 
it up in the same manner as above to merge with the 
tape containing our scheduling information. Finally, we 
have a tape containing the operations, including job 
number written in priority order. This tape feeds back 
into the computer and scheduling takes place. Assem- 
blies and sequential operations are taken care of by using 
buckets to hold dates until the required pieces are fin- 
ished. (See Fig. 4.) From this, we get a tape containing 
start dates in priority order and sequence number. This 
tape is read into memory and merged with our original 
input tape to place a starting date on each operation. 
(See Fig. 5.) The result is punched out as dispatch cards 
for the shop areas. Of course, while scheduling is going 
on, the computer is tabulating load per area per day. If 
maximum capacity is reached in any area per day, the 
tabulation and schedule move to the next day. 


0-9000 


Priority 


1 

1 

Sequence Number 


18 Bits 

1 

1 

1 

1 

1 

1 

18 Bits 


9000-18,000 



Fig. 3 — Illustration of memory division for sorting purposes. 


WORKLOAD/HOUES 



Fig. 4 — Example of machine-tool area loading logic. 


The printed work load (see Fig. 6) is sent to the Pro- 
duction Scheduling Office. The number at the far left 
denotes the machine tool areas, such as grinding, mill- 
ing, etc. The schedule starts as of the date printed at the 
top of the page. The work allotted to each area each day 
for the next 42 days is printed. Finally, the area’s maxi- 
mum capacity per day is printed on the far right. If they 
feel they can do better on work load, they may reshuffle 
priority and run again. (Our future plans are to have the 
computer determine optimum schedule.) The punched 
cards are sent to the machine tool areas. The workmen 
fill in the hours worked or finished on the card and the 
cards are returned to the scheduling office to be remade 
for next week’s run. That is, if a workman has com- 
pleted 15 hours of a 40-hour job, he marks 15 on the 
card. The scheduling office will remake the card with 
25 instead of 40 estimated hours and place it in the deck 
to be sent to Computations. (See Fig. 7.) 

Some interesting sidelights regarding the computer 
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Part Name 

Area 

Job Number 


Planned Hrs. 

Start Date 

Shaft Gear Spur 

202 

245 

5-6-00169 

000 

04 

7001 

0000 

10/23 

Shaft Gear Spur 

212 

255 

5-6-00169 

000 

04 

7001 

0017 

10/27 

Shaft Gear Spur 

201 

265 

5-6-00169 

000 

04 

7001 

0030 

10/27 

Shaft Gear Spur 

Mag 275 

5-6-00169 

000 

04 

7001 

0000 

10/28 

Shaft Gear Spur 

Ins 

285 

5-6-00169 

000 

04 

7001 

0000 

10/30 

Shaft Gear Spur 

212 

295 

5-6-00169 

000 

04 

7001 

0032 

11/03 

Shaft Gear Spur 

Ins 

305 

5-6-00169 

000 

04 

7001 

0000 

11/06 

Ring Spl Adapt 

274 

020 

5-6-40151 

000 

01 

0001 

0050 

10/01 

Ring Spl Adapt 

212 

030 

5-6-40151 

000 

01 

0001 

0010 

10/02 

Ring Spl Adapt 

Ins 

040 

5-6-40151 

000 

01 

0001 

0000 

10/17 

Comp Rotor 

268 

050 

5-7-80042 

803 

01 

0001 

0120 

10/02 

Comp Rotor 

212 

060 

5-7-80042 

803 

01 

0001 

0030 

10/03 

Comp Rotor 

Ins 

070 

5-7-80042 

803 

01 

0001 

0000 

10/07 

Blade Stg 3 

Zyg 

020 

5-7-80045 

803 

01 

0001 

0000 

09/29 

Blade Stg 3 

Ins 

030 

5-7-80045 

803 

01 

0001 

0000 

10/03 

Comp Blade Stg 4 

212 

010 

5-7-80045 

803 

01 

0002 

0204 

09/29 

Comp Blade Stg 4 

Zyg 

020 

5-7-80045 

803 

01 

0002 

0000 

09/30 

Comp Blade Stg 4 

Ins 

030 

5-7-80045 

803 

01 

0002 

0000 

10/06 

Comp Blade Stg 8 

Zyg 

020 

5-7-80045 

803 

01 

0003 

0000 

09/29 

Comp Blade Stg 8 

Ins 

030 

5-7-80045 

803 

01 

0003 

0000 

10/03 

Bid Comp Stg 16 

Ins 

030 

5-7-80045 

803 

01 

0004 

0000 

10/03 

Bid Comp Stg 9 

Zyg 

020 

5-7-80045 

803 

01 

0005 

0000 

09/29 

Bid Comp Stg 9 

Ins 

030 

5-7-80045 

803 

01 

0005 

0000 

10/03 

C/R Blade Stg 10 

212 

010 

5-7-80045 

803 

01 

0006 

0495 

09/29 

C/R Blade Stg 10 

Zyg 

020 

5-7-80045 

803 

01 

0006 

0000 

10/03 

C/R Blade Stg 10 

Ins 

030 

5-7-80045 

803 

01 

0006 

0000 

10/09 

C/R Blade Stg 1 

Ins 

030 

5-7-80045 

803 

01 

0007 

0000 

09/29 

C/R Blade Stg 1 

Zyg 

120 

5-7-80045 

803 

01 

0007 

0000 

10/01 

C/R Blade Stg 

212 

010 

5-7-80045 

803 

01 

0008 

0609 

09/29 

C/R Blade Stg 

Zyg 

020 

5-7-80045 

803 

01 

0008 

0000 

10/06 

C/R Blade Stg 

Ins 

030 

5-7-80045 

803 

01 

0008 

0000 

10/10 

C/R Blade Stg 

Zyg 

020 

5-7-80045 

803 

01 

0009 

0000 

09/30 

C/R Blade Stg 

Ins 

030 

5-7-80045 

803 

01 

0009 

0000 

10/06 

C/R Blade Stg 12 

Zyg 

020 

5-7-80045 

803 

01 

0010 

0000 

09/30 

C/R Blade Stg 12 

Ins 

030 

5-7-80045 

803 

01 

0010 

0000 

10/06 

C/R Blade 

Zyg 

020 

5-7-80045 

803 

01 

0011 

0000 

09/30 

C/R Blade 

Ins 

030 

5-7-80045 

803 

01 

0011 

0000 

10/06 

C/Disc Stg 5 

212 

020 

5-7-80045 

806 

01 

0001 

0050 

09/29 

C/Disc Stg 5 

268 

030 

5-7-80045 

806 

01 

0001 

0100 

10/07 

C/Disc Stg 5 

212 

040 

5-7-80045 

806 

01 

0001 

0025 

10/09 

C/Disc Stg 5 

Ins 

050 

5-7-80045 

806 

01 

0001 

0000 

10/14 

C/Disc Stg 6 

212 

020 

5-7-80045 

806 

01 

0002 

0050 

09/29 

C/Disc Stg 6 

268 

030 

5-7-80045 

806 

01 

0002 

0100 

10/09 

C/Disc Stg 6 

212 

040 

5-7-80045 

806 

01 

0002 

0025 

10/13 

C/Disc Stg 6 

Ins 

050 

5-7-80045 

806 

01 

0002 

0000 

10/15 

C/Disc Stg 7 

212 

020 

5-7-80045 

806 

01 

0003 

0050 

09/29 

C/Disc Stg 7 

268 

030 

5-7-80045 

806 

01 

0003 

0100 

10/10 

C/Disc Stg 7 

212 

040 

5-7-80045 

806 

01 

0003 

0025 

10/14 

C/Disc Stg 7 

Ins 

050 

5-7-80045 

806 

01 

0003 

0000 

10/16 

Stub Shaft 

Ovn 

010 

5-7-80045 

807 

01 

0001 

0000 

09/29 

Stub Shaft 

263 

020 

5-7-80045 

807 

01 

0001 

0040 

10/03 

Stub Shaft 

Ins 

030 

5-7-80045 

807 

01 

0001 

0000 

10/07 / 

Comp Disc Stg 

Ins 

050 

5-^-80045 

809 

01 

0001 

0000 

10/09 


Fig. 5 — Output for Scheduling Office. 


Area 

201 

Day 

48.0 

1 to 42 
48.0 

48.0 

48.0 

48.0 

0.0 

Production Scheduling 

Day by Date Load 09/22/58 

0.0 48.0 48.0 48.0 48.0 48.0 

0.0 

0.0 

Capacity/day 

48.0 hrs. 

— 

48.0 

48.0 

48.0 

48.0 

24.0 

0.0 

0.0 

13.1 

0.0 

2.4 

1.8 

0.9 

0.0 

0.0 

— 

— 

0.6 

0.8 

0.6 

3.7 

3.5 

0.0 

0.0 

0.0 

2.5 

2.0 

1.0 

0.0 

0.0 

0.0 


— 

203 

32.0 

32.0 

32.0 

32.0 

31.0 

0.0 

0.0 

13.5 

13.5 

30.5 

30.5 

0.0 

0.0 

0.0 

Capacity/ day 

32.0 hrs. 

— 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

— 

— 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 


— 

206 

32.0 

14.5 

23.5 

15.5 

0.0 

0.0 

0.0 

6.0 

16.5 

20.0 

14.9 

12.1 

0.0 

0.0 

Capacity/day 

32 . 0 hrs. 

— 

16.0 

13.0 

10.0 

8.0 

0.0 

0.0 

0.0 

0.0 

12.0 

8.0 

0.0 

0.0 

0.0 

0.0 

— 

— 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 


— 

207 

32.0 

32.0 

32.0 

32.0 

32.0 

0.0 

0.0 

32.0 

32.0 

25.0 

2.5 

0.0 

0.0 

0.0 

Capacity/day 

32.0 hrs. 

— 

0.0 

17.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

— 

— 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 


— 

208 

32.0 

32.0 

32.0 

32.0 

32.0 

0.0 

0.0 

17.0 

3.5 

5.0 

0.0 

18.0 

0.0 

0.0 

Capacity/day 

32.0 hrs. 

— 

0.0 

12.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

— 

— 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 


— 

209 

16.0 

15.5 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

2.5 

0.0 

6.0 

12.0 

0.0 

0.0 

Capacity/day 

16.0 hrs. 


Fig. 6 — Listing of punched-card output. 





are the following. Our 32K memory is completely filled 
twice during running. Six tape units are required. The 
computer running time is 30 minutes. In addition, two 
hours’ handling time is required to prepare the input. 

At one time an experimental task force tried to do the 
complete scheduling job by hand. By the end of four 
days of round-the-clock work, they threw up their 
hands. The machine system has now been in operation 
several months with a resultant 4 per cent drop in idle 
labor. Savings the first year, above the cost of the pro- 
gram, are estimated at a minimum of $31,000, but are 
probably much greater. 

In summary, the advantages of the system are 

1) realistic, up-to-date and on-time schedules, 


2) better work loading for minimum idle time of men 
and machines, 

3) compact and easy method of placing work in areas, 

4) use of the program to predict results of crash pro- 
grams on delivery schedules and dates. 

An added advantage not foreseen is that a tool dis- 
patcher now places tools at all machines prior to the 
workmen starting on the job, instead of having the 
workmen obtain their own tooling after receiving the 
job. This is possible since we know what type of work 
will be at each machine tool and on what day. Delays 
in schedule due to tool shortages no longer occur. 

As for future plans, we are making a study of schedul- 
ing the entire integrated operation as seen in Fig. 1. 
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Numerical Methods for High-Speed Computers — 

A Survey* 

GEORGE E. FORSYTHEf 


Classification of Problems 

N UMERICAL analysis is the science and art of 
using digital computing machines to carry out 
scientific computations, excluding pure data 
processing. Because of the recent changes in computers, 
the field is dominated today by the problems of using 
the large stored-program digital computers. Analog 
computers, though important both for their output and 
as a source of methods (too seldom exploited) for digital 
computers, are not considered here. It is possible to give 
a rough mathematical classification of the types of com- 
puting problems ordinarily met. The following classifica- 
tion is adapted from Forsythe [4]. 

A pproximation 

1) Evaluate functions of one or more variables. Inte- 
grate, differentiate, and interpolate them. Sum a series. 

2) Approximate functions of one or more variables 
by simpler functions in the sense of least squares, least 
deviation, or other norms. 

3) Test empirical data for blunders and fill in missing 
data. Smooth such data, and fit them with typical 
curves. Integrate and differentiate them. Analyze their 
spectra. 

Algebra and Number Theory 

4) Solve a number of simultaneous algebraic or tran- 
scendental equations, 

a) Linear. 

b) Nonlinear, including algebraic eigenvalue prob- 
lems and polynomial equations. 

5) Solve systems of linear or nonlinear inequalities, 
a) Programming problems, 
b) Problems from game theory. 

6) Combinatorial problems (dealing with functions 
of permutations). 

7) Problems from number theory. 

Analysis and Functional Equations 

8) Find the maximum of a function of one or more 
variables. 

9) Solve initial-value problems for one or more 
ordinary or partial differential equations. 

10) Solve boundary- value problems for one or more 
ordinary or partial differential equations. 

11) Solve problems for other functional equations, 
including, for example, 

* The writing of this exposition was sponsored by the Office of 
Naval Res., under Contract Nonr-225(37), NR 044 211. 
f Stanford University, Stanford, Calif. 


a) Differential equations with retarded arguments. 

b) Difference-differential equations. 

c) Conformal mapping problems. 

Simulation 

12) Simulation of random noise — for example, with a 
prescribed power spectrum. 

13) Simulation of physical systems as an alternative 
to setting up equations for them, e.g., traffic flow, or 
animal nervous systems. 

The Wide Scope of Numerical Analysis 

In treating any of the above classes of problems, there 
is a very wide class of activities which comes under the 
heading “numerical analysis,” and an additional group 
of almost inseparable activities called “mathematical 
analysis” at the origin of the problem, and “program- 
ming” or “coding” on the machine end. Let us briefly re- 
view these. 

Problem Formulation 

The formulation of the mathematical problem to be 
solved is applied mathematical analysis, and not numer- 
ical analysis. However, the computational feasibility of 
the problem is a very important factor in the selection 
of a mathematical problem by which to model a given 
physical problem. 

Replacement by an Algebraic Problem 

Many of the mathematical problems of problem 
formulation A are not algebraic. Basically, digital com- 
puters can only add, subtract, multiply, and divide. 
Hence, to be tractable by a computer a problem must be 
algebraic— dealing with only a finite number of vari- 
ables. The reduction of a transcendental problem to a 
related algebraic problem (or sequence of algebraic 
problems) may be called “discretization,” and the cor- 
responding error, the “discretization error.” The study 
of discretization is an important part of numerical 
analysis. 

Design of a Theoretical Algorithm 

Having an algebraic problem to deal with, numerical 
analysts design algorithms with which to solve it. At 
this stage it is expedient and customary to think of ideal 
algorithms in which exact arithmetic operations are 
carried out with real numbers. Some algorithms are 
“direct,” terminating with the answer to the algebraic 
problem in a finite number of steps. Others are “itera- 
tive,” and attain the answer only as the limit of a se- 
quence of steps. 
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Convergence and Errors of Iterative Algorithms 

Numerical analysts study the convergence and other 
asymptotic properties of iterative algorithms. The fact 
of convergence is almost essential to their use, while the 
speed of convergence is economically important. Knowl- 
edge of the asymptotic behavior assists one in accelerat- 
ing the convergence, which is almost always too slow. 
Finally, at a higher level of sophistication, one ought to 
have at any stage of the iteration a means of bounding 
the difference between the current iterate and the exact 
solution of the algebraic problem. 

Digitalization and Round-Off Error 

We know that digital computers do not deal with real 
numbers, but only with rational numbers. In practice, 
computers usually use only special kinds of rational 
numbers — short terminating fractions to base 2 or 10, 
called “digital numbers.” The arithmetic operations 
heretofore discussed are replaced by various pseudo- 
operations. This introduces round-off errors and raises 
all kinds of difficult questions about the algorithm and 
its utility. These, too, are treated by numerical anal- 
ysis, and represent a major point of contact with ac- 
tual computers. 

Programming and Coding 

Actually getting problems on a digital computer, the 
goal of programming and coding, is so closely related to 
numerical analysis that it is difficult to separate them. 
The selection of an algorithm is bound up with the avail- 
able means for approximating it digitally on a com- 
puter. For many problems, like the solution of Dirichlet’s 
problem over a complicated region, the machine de- 
termination of difference equations (hand determina- 
tion would be unthinkable for large problems) requires 
a code far more difficult than the machine solution of 
the equations. 

The wide range of activity naturally forces practicing 
numerical analysts to specialize. I think it is vital that 
such specialization be along “vertical” lines, i.e., that 
each analyst deal with problems from their formulation 
right through their coding and operation, confining 
himself, if necessary, to a few problems. In this way the 
feedback between machine peculiarities and problem 
formulation is most apt to be effective. This feedback 
should lead to the most rapid evolution of mathematical 
methods for using the new computers. As an immediate 
benefit, such a plan of operation should be the truest 
safeguard against major blunders, for such blunders 
arise most easily in areas of misunderstanding between 
different responsible persons. 

Unfortunately, the organization of mathematical 
groups in many companies is in just the opposite direc- 
tion. Engineering groups and mathematical analysis 
groups emerge, as well as programming groups and 
machine operating groups, and there is resistance to re- 
laxed communication among them. It would be a great 


contribution to the use of computers — and to our entire 
technology — if some organizational genius could find a 
method of keeping problems in the hands of very small 
responsible groups from their inception to their solu- 
tion. 

Current Status of the Mathematical Areas 

It is manifestly impossible to survey here the current 
status of all areas of numerical analysis. Let me merely 
point to a few sources of new research in the fields, fol- 
lowing the outline of the first section. Many exceedingly 
important new contributions will necessarily fail to be 
mentioned here. 

Approximation 

A major conference on approximation was held at the 
Army Mathematical Center in Madison, Wis., in April, 
1958. The proceedings of that conference, to be pub- 
lished, will be an excellent synopsis of current thought 
on approximating functions. Problems of numerical 
integration, differentiation, etc., can be given a formula- 
tion in terms of finding nearest vectors in such function 
spaces as Hilbert space. (See Sard [l 6 ] .) While more 
conventional approaches, based largely on polynomial 
interpolation, are probably an adequate basis for deal- 
ing with functions of one variable, it is probable that 
approximation problems for functions of two or more 
variables will be solved effectively only in terms of func- 
tion spaces. And it was quite clear at the Madison con- 
ference that the great need now is for suitable methods 
for integrating, interpolating, and approximating func- 
tions of several variables. 

The problem of estimating the power spectrum of a 
random process from a sample time function has become 
exceedingly important in modern engineering. The 
problem is one of statistical estimation, and progress is 
coming largely from statisticians like Tukey [13 ]. 

One area in approximation theory, that of minimax 
approximation, is receiving considerable attention. It 
has been known for years that polynomial and rational 
functions of least deviation from fix) over an interval 
are characterized by residuals with a certain equal- 
alternation property. Since a computer can generate 
any rational function of x, and only rational functions 
of x (we ignore problems of digitalization) , it is natural 
to seek rational approximations to the common tran- 
scendental functions of analysis. Hastings and his col- 
laborators [6 ] have published a book of useful approxi- 
mations obtained by hand-tailoring methods. But it is 
widely agreed that digital computers can and should 
generate such approximations. Work of Remez [14] led 
to an algorithm by Novodvorskii and Pinsker [l 1 ] for 
this purpose. The algorithm has been improved and 
coded at various computer centers, but it is not yet clear 
that a best procedure has been found. The problem is 
really one of linear programming with an infinite num- 
ber of conditions. 
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While it is an attractive and interesting numerical 
analysis problem, for example, to find the rational func- 
tion of degree 10 which best approximates cos x over an 
interval, this may not in fact be the most appropriate 
approximation to cos x in practice. It may be that a 
piecewise quadratic approximation would be faster to 
use, only slightly more bulky to store, and easier to de- 
termine. Professor C. B. Tompkins has mentioned that 
such an approximation is very easily tailored by an 
automatic computer to fit detailed specifications. Thus 
we must always ask whether numerical analysts are in 
fact applying their talents to solve the right problems. 

Algebra 

Problems of linear algebra and the solution of poly- 
nomial equations have been studied extensively during 
the present decade. In his 1958 lectures in Ann Arbor, 
Wilkinson [19] greatly increased our understanding of 
several common methods for these problems. His forth- 
coming book will contain the same material and more, 
while a little of the material is already available [2]. 
Wilkinson’s most important basic contribution seemed 
to be a satisfactory and realistic analysis of the round-off 
error in Crout’s method for Gaussian elimination. The 
importance of “searching for a pivot” was demonstrated 
conclusively. 

Wilkinson discusses the calculation of the eigenvec- 
tors of a tridiagonal matrix, a necessary part of either 
the Givens or Lanczos methods for solving the complete 
eigenvalue-eigenvector problem for finite matrices [20 ]. 
He also gave the following startling example of the in- 
stability of the roots of a polynomial. Let 

f{x) = (x + 1)0 + 2) • • • O + 20) 

= x 20 + 210x™ + • • • , 

with zeros —1, —2, • • • , —20. Suppose just one co- 
efficient of f{x) is slightly perturbed, say because of 
rounding, so that instead of f(x) one deals with g(x) 
=f(x)+2~ 23 x 19 . Then Wilkinson has found that the 
zeros of g include numbers near —20.846, and —13.99 
+ 2.5 i. In face of this evidence, no one should ever 
again blindly ask any automatic routine to find the 
zeros of a polynomial of importance! 

Householder [7 ] has reported a new algorithm which 
might be an improvement on Givens’ very excellent 
routine for getting eigenvalues of symmetric matrices. 
For unsymmetric matrices the eigenvalue problem re- 
mains one for which there are many routines, with none 
of them outstanding enough to become standard. 
Ostrowski [12 ] has published one of the first papers 
bounding the variation in the eigenvalues of an unsym- 
metric matrix, due to perturbations in its elements. 
This has very important consequences for round-off 
error. It will be important to improve Ostrowski’s re- 
sults for various special cases where root multiplicity can 
be guaranteed to be less than the order of the matrix. 


I shall not attempt to report on the progress in dealing 
with inequalities, combinatorial problems, or number 
theory. 

Functional Equations 

The status of the numerical solution of ordinary dif- 
ferential equations is being reported in a forthcoming 
book by Henrici, starting from the points of view of 
Rutishauser, Dahlquist, and others. The well-known 
methods of Milne [9] have generally proved unstable on 
automatic computers, although recent modifications 
may save them [10 ]. 

A recent book by Richtmyer [15] treats difference 
methods for marching problems of partial differential 
equations. A forthcoming book by Forsythe and Wasow 
will deal with difference methods for linear partial dif- 
ferential equations of second order, including both 
marching and jury problems. The concept of stability 
will be quantified somewhat for marching problems. 
The work of Young, and Peaceman-Rachford and suc- 
cessors, will be recorded. Work by Kahan and others on 
overrelaxation without Young’s property A will be men- 
tioned. 

As indicated earlier, a large practical problem in put- 
ting Dirichlet’s problem on a computer is to generate the 
difference equations. A very substantial project, called 
SPADE, is operating under SHARE to automate the 
generation and solution of difference equations. Such 
work is as important as it is difficult, and needs solid 
sponsorship. 

The work of Gerschgorin is vital in bounding the dis- 
cretization error due to replacing an elliptic boundary- 
value problem by difference equations. The Gersch- 
gorin result has been extended by Wasow [18 ]. Recent 
Russian work is extending the results to cases where the 
unknown solution does not have bounded third deriva- 
tives. (See Volkov [17].) 

The Gerschgorin theory has not been developed to 
deal with internal interfaces, and it would be highly de- 
sirable to make the extension. There are many ques- 
tions open for investigation in connection with graded 
nets, regions with corners, and so on. 

Simulation 

About simulation there is only one topic I wish to 
mention. An increasingly important area in modern 
engineering is that of random noise. The simulation of 
random noise is important in many ways, and methods 
for analog computers are discussed, for example, by 
Laning and Battin [8], The author holds that a well- 
designed digital computer can do anything an analog 
computer can, and better, provided that the coders 
know what to code. It seems to me time that someone 
investigated and wrote about the simulation on a digital 
computer of random noise with a prescribed power 
spectrum or autocorrelation function. 
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Treatment of Error 

Before the advent of computers one thought about 
discretization errors, and these are still important. In 
dealing with desk computations one had to worry about 
arithmetic blunders, but these are comparatively rare in 
automatic computation. But the characteristic error in 
much automatic computation is an error due to the 
digitalization of the numbers in a computer, with re- 
sultant approximation of the arithmetic operations, to- 
gether with the cumulative effects of these errors 
through numerous subsequent operations. One now 
finds several points of view emerging with regard to 
these errors. 

1) Let me illustrate one new point of view in the con- 
text of the solution of a linear algebraic system Ax = b, 
although it has far wider applications. Here A is a 
given nonsingular matrix of digital elements, while b is 
a given vector of digital components. Let h = A~ x b be 
the true solution of the system. Suppose one has found a 
digital vector x which approximately solves the system. 
How does one measure the error in x? 

The conventional concept of error is the vector x — h, 
or some number ||x — h\\ which measures its smallness. 
The vector x — h answers the question, how wrong is x 
as a solution of the problem with the given data A, b? 
It is comparatively difficult to estimate |[x — h\\, since it 
requires knowledge of the size of A~ x . 

Givens [5] and Wilkinson [19] are asking a different 
question in formulating the concept of the error in solv- 
ing Ax — b. Namely, how necessary is it to alter the given 
data A, b to new values A i, bi, in order that the approx- 
imate solution x correctly solve the altered problem 
Aix = bR They consider the matrix A— A\ and the 
vector b — bi as a definition of the error in x. Again, they 
might adopt some number of the form e= {HA— Hi|| 2 
+ |(& — &i|| 2 } 1/2 as a measure of the smallness of A — A\ 
and b — bi. 

Of course A\ and bi are not uniquely determined by 
x. With suitably chosen norm functions one could force 
uniqueness by demanding that Ai, b\ be closest to A, b, 
in the sense of minimizing the e defined above; let mine 
denote that minimum value. However, it would prob- 
ably be difficult to compute min e or the minimizing 
quantities A i, bi, and all that is really needed is a rea- 
sonable upper bound for min e. The real power of the 
Givens-Wilkinson suggestion is that it is easy to bound 
min e in solving linear systems by a number of methods. 
I shall not be able to go into this, except to mention the 
use of the residual b — Ax. 

One simple way to bound min e has been used for a 
long time. Let bi = Ax, and let Ai = A. Then Aix = bi, 
and min e<||6 — &i|| =||& — Ax\\. Here one has concen- 
trated the necessary alteration of the data in the vector 
b. The vector b — Ax is easy to compute and is often al- 
ready available as a byproduct of a solution of Ax = b. 
The idea of measuring the error in x by the size of the 


residual b—Ax is not a new one, of course, but it has 
usually been considered only as an expedient substitute 
for the “real error” x — h. What is new in recent thinking 
is to consider that such a number as ||&— Hx|| (or the 
more general e) may often be a fair measure of the error 
in solving Ax = b. Of course, the decision as to what is an 
appropriate measure of the error in solving a system can 
only be decided after an examination of the physical 
problem underlying the computation. 

2) A second major development is the advent of 
codes which precisely bound the error of a calculation. 
Perhaps the most promising are the range arithmetic 
codes prepared under the supervision of Ramon Moore 
at the Lockheed Missile Systems Division, Palo Alto, 
Calif. In these codes, the operands are the range num- 
bers of Dwyer [3], i.e., real intervals. The answer to a 
range multiplication, for example, is the smallest digital 
interval which includes all products of pairs of numbers 
chosen from the intervals of both factors. 

Range arithmetic seems to be the most efficient form 
of error computation by a machine, and machine compu- 
tation seems far easier and more efficient than human 
bounding of errors. It is to be hoped that range arith- 
metic will find its way into all algebraic translators, as 
an optional form of arithmetic. 

3) There have been suggestions by Wilkinson, Carr, 
and others, that problems be computed several times — 
for example, one each with eight, nine, ten, eleven, 
twelve, and thirteen decimals. From a comparison of 
the answers, one should be able to guess the round-off 
errors quite satisfactorily in some problems. Such ability 
to do arithmetic with variable word length is not easily 
available on most machines, but should be made 
available. 

4) Givens has emphasized the importance of having 
programs which print out guaranteed bounds for 
answers to problems. For example, if a polynomial is 
input (perhaps by means of its coefficients as range num- 
bers), an ideal routine might output in effect several 
rectangles in the complex plane, together with the 
number of zeros in each rectangle. Such a routine should 
be without the slightest mathematical error; its asser- 
tions should be known to be correct for a specified class 
of polynomials. And, if the input polynomial turns out 
not to be in that class, that fact should be output. 

One knows in principle how to write such routines for 
various types of problems. What is lacking is definitive 
experience with them, for very few have been actually 
written. With polynomial zero-finders, it is certain that 
acceptably small rectangles can be found, by use of 
multiple-precision arithmetic, if necessary. But for 
ordinary differential equations, for example, it may be 
that no realistic solution bounds could ordinarily be 
found, if they must be rigorously correct. A good way to 
settle this question is to accumulate much experience 
with well-written codes taking full advantage of the 
speed of current machines. 
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Free Experimentation and Teamwork of 
Man and Machine 

It is safe to say that we are all using automatic digital 
computing machines very badly. It is characteristic of 
humans to use new tools as though they were extensions 
of old tools, and only to devise appropriate new meth- 
ods very slowly. It would be most desirable to speed up 
our process of accommodation to the new machines, 
since otherwise one wonders if numerical analysts will 
ever catch up with computer engineers ! 

I suppose that the surest road to progress is to en- 
courage imaginative experimentation by those close to 
machines, and especially to attract imaginative people 
to computer laboratories. I feel it would help if machine 
administrators would allow — yes, and encourage — 
their most imaginative numerical analysts to play per- 
sonally with the newest machines as much as possible. 

In the early days of automatic computation, research 
workers with important computing problems, physical 
or mathematical, would always be present when their 
computations were being carried out on automatic com- 
puters. They would help trouble-shoot the code, and 
use the preliminary results to suggest new cases to try 
and parameters to vary. The same method of operation 
is customary now with analog computation. 

However, it was discovered that inquisitive research 
workers would waste machine time while they pondered 
the problem, and probably they sometimes got in the 
way of the machine operators. As a result, in many 
companies programmers and research workers are ef- 
fectively discouraged from attending machine runs. 
Elaborate automonitoring and other efficient routines 
enable the machine to operate automatically for hours 
at a time, running codes which are preassembled on 
tapes. As a result, the apparent efficiency of a computing 
laboratory increases, when measured in good operating 
time per month. However, the real efficiency may well 
be found to decrease, when measured in problems 
solved per month. The reason is that the machine is be- 
ing deprived of its most valuable component — an in- 
telligent human who knows the problem. Knowing he 
will not be present at the console, the coder tries to 
cover every eventuality, and sometimes asks for ten 
times as much computing as would be necessary if he 
could be present. Is this efficiency? 

There are sound economic reasons for insisting on 
reasonable efficiency in the machine room, and I do not 
advocate wasteful practices. But the gain from reason- 
able efficiency should include the gain from reasonable 
use of man as the most important ally of the machine. I 
feel that if a scientist wishes to work with the machine, 
it is up to the administrators to design methods whereby 
this collaboration can be efficiently and effectively car- 
ried out. Machine designers should also bear this in 
mind, and provide enough visual monitors and enough 
conveniently adjustable console registers, so that man 


and machine may intercommunicate quickly and effec- 
tively. In this way man can return to the automatic 
digital computing loop in the effective way he is present 
in other types of computation. 

It must be admitted that many administrators dis- 
agree with the last paragraph. They feel that an auto- 
matic digital computing loop can operate quite effec- 
tively at the slower rate which occurs with unattended 
runs, when the analyst ponders the output overnight, 
returning next day with his new input. And the collabo- 
ration of man and machine may sometimes be more ef- 
fective because of the extra time spent by the analyst in 
pondering his next move . 1 

I am sure that unattended runs are often adequate, 
and especially often for the more easily analyzed and 
better understood classes of problems. But it is char- 
acteristic of difficult problems that they are only im- 
perfectly understood, so that the analyst may be in 
grave doubt about the choice of computing method and 
about various other matters, and even about where 
trouble will occur. But even at high speed it is often 
possible to read orders of magnitude from console 
registers and thus monitor a computation. For some 
problems an analyst in attendance can come prepared 
with various cards, for example, and can interpose 
changed values or alternate codes very early in a long 
computation, at a large net saving of computer time 
over just letting the machine run unattended. Can’t we 
agree to give a scientist a few seconds to make such ad- 
justments during a run? This is not to ask for many min- 
utes for pondering a surprising turn of events — that 
would be very wasteful by any criterion. 

I suppose the decision as to attending a run should be 
the decision of how a creative but responsible man feels 
he can best operate. A creative scientist is a rare person, 
and in his research should be permitted wide latitude in 
choosing his hours, location, and methods— including 
his technique of collaboration with a machine. In funda- 
mental terms, computers can be manufactured in great 
numbers, and there are many persons capable of ad- 
ministering their operation. The really critical shortage 
remains that of creative scientists and engineers who 
can make effective use of their tools. Hence, at least on 
any problem of importance, the time of a scientist is 
really more valuable than that of a machine, and all ob- 
stacles should be cleared away from his use of a machine 
in the way he feels will solve his problem most effec- 
tively. No one should ever compare a creative man with 
a machine on the basis of their dollar costs per hour! 

Literature on Numerical Analysis 

In a field growing as fast as numerical analysis, it is 
difficult even to learn the names of important new books 
and journals, let alone read them. I shall mention here 

1 The author is indebted to Dr. Walter F. Bauer for an exchange 
of ideas on this matter. 
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the names of a few journals which have a fair number of 
articles or abstracts on the numerical analysis appropri- 
ate to high-speed digital computers. 

Journals 

Chiffres (France), The Computer Journal (England), 
Communications of the Association for Computing Ma- 
chinery , Journal of the Association for Computing Ma- 
chinery, Journal of Research of the National Bureau of 
Standards, Journal of the Society for Industrial and Ap- 
plied Mathematics, Mathematical Reviews, Mathematical 
Tables and Other Aids to Computation, Zeitschrift f4ir 
numerische Mathematik (West Germany), Proceedings 
of the Cambridge Philosophical Society, Quarterly of Ap- 
plied Mathematics, Quarterly Journal of Mechanics and 
Applied Mathematics (England), Referativnyl Zhurnal — 
Matematika (Soviet Union), Review of the Society for 
Industrial and Applied Mathematics, VychisliteVnaia 
Matematika (Soviet Union), Zeitschrift fur angewandte 
Mathematik und Mechanik (East Germany), Zeitschrift 
fur angewandte Mathematik und Physik (Switzerland), 
Zentralblatt fur Mathematik und ihre Grenzgebiete. 

Any library associated with a digital computer center 
should have current subscriptions of all these journals, 
together with as complete a back file as possible. 

Books 

Books on numerical analysis which take real account 
of automatic computation have been slow to appear, 
perhaps because the field changes so rapidly. I should 
like merely to call your attention to two books which 
are too recent to be well known. Complete citations are 
in the bibliography below. The first is “Modern Com- 
puting Methods” [2], written anonymously by four 
mathematicians in the National Physical Laboratories, 
near London. It is thin but full of good material by per- 
sons intimately involved in computation, both before 
and after the computer revolution. The bibliography of 
128 titles is completely annotated. 

The second is a general survey [l] of machines by 
Alt, including coding and problem analysis. There is a 
12-page bibliography, implicitly annotated by cross- 
references to the text. 
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More Accurate Linear Least Squares 

RICHARD E. VON HOLDT| 


The Linear Least-Squares Problem 

ET A be a given matrix of m rows and n columns, 
( m>n ), so that AW = 0 implies IT— 0 (i.e., the 
^ column vectors of A constitute a linearly inde- 
pendent set), and let Fbe a given m-dimensional vector. 
We seek an ^-dimensional vector X, so that | R(X ) | 2 is 
the minimum value of | R(X) | 2 where 

R(X) = AX - Y. (1) 

A Geometric Derivation of the Classical 
Solution to the Linear Least-Squares 
Problem 

Let S be that subspace of m-dimensional Euclidean 
space which is spanned by the column vectors of A. 
Then for arbitrary X, AX is a vector in S, and R(X) is a 
vector with initial point at the terminal point of Y and 
terminal point in S'. 

Let AX be the orthogonal projection of F onto S. 
Then 

R(X) = AX - Y, (2) 

is orthogonal to S, or 

A T R(X) = 0. (3) 

For arbitrary X, we have 

R(X) = [R(X) - *(:?)] + R(X) 

= A (X - X) + RiX). (4) 

From (3) and (4), for arbitrary X, we have 

| R(X) | 2 = | A(X - X) ! 2 + | R(X) | 2 > | R(X) | 2 . 

Thus | R(X) | 2 is the minimum value of | R(X) | 2 for arbi- 
trary X. Substituting (2) into (3), X must satisfy the 
relation : 

A T AX = A T Y. (5) 

From the hypothesis on A, we have 

A T AW = 0-^W T A T AW s | AW\ 2 = 0-+W = 0. (6) 

Hence A T A is a nonsingular nXn matrix and (5) has a 
unique solution. 

The Solution of (5) by Diagonal Pivots 

Let Mk , (k = l, 2, • • • , n), be the matrix obtained 
from A by deleting all but the first k columns of A. Then 
the columns of Mk form a linearly independent set and 
by the argument of (6), MJMk is a nonsingular matrix, 
(£ = 1, 2, • • • , n). 

f Lawrence Rad. Lab., University of California, Livermore, 
Calif. 


Let Pi be the determinant of MJM\ and Pk be the 
determinant of MjJMk divided by the determinant of 
Mk-i T Mk-i, (k = 2, 3, • • • , n). Then 

Pus* 0, (k = 1, 2, • • • , n). (7) 

Using the diagonal elements, in increasing order, as 
pivots, and combining proper multiples of each row into 
all following rows to produce zeros below the pivot ele- 
ments in the column containing the pivots in (5) does 
not change the value of any of the minors of A T A 
formed by deleting all but the first k rows and all but 
the first k columns of A T A. Thus this process of replac- 
ing (5) by an equivalent upper-triangular system of 
equations yields the successive nonzero pivots: 

Pi, P‘2.1 ’ • • j Pn- (8) 

The above described process is equivalent to premul- 
tiplying both sides of (5) by a lower-triangular matrix 
L, with unit diagonal elements, yielding 

LA T AX = LA T Y (9) 

and this upper-triangular system is solved by back sub- 
stitution. 

The Solution of (5) by Orthogonalization 

Using the columns of A, in increasing order, as pivot 
columns, and combining proper multiples of each pivot 
column into all following columns so that the resulting 
columns are orthogonal to the pivot column, replaces the 
columns of A by an orthogonal basis for S. Let the 
matrix which results be denoted by B. Then 

B — AU (10) 

where U is an upper-triangular matrix with unit-diag- 

onal elements, and furthermore 

B T B = D, (11) 

where D is a diagonal matrix. 

Premultiplying (5) by £/ r and replacing A by UU~ l £, 
we have 

DU~ l X = B T Y (12) 

and from this, 

X = UD~ l B T Y. (13) 

Comparison of Methods 

Since both LA T A and L T are upper- triangular ma- 
trices, their product, LA T AL T , is also upper triangular, 
besides being symmetric, and is therefore a diagonal 
matrix. Thus AL T is a matrix of mutually orthogonal 
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columns and since L T is upper-triangular with unit- 


diagonal elements, 

L T = U. (14) 

Again, since L T is upper-triangular with unit-diag- 
onal elements, the diagonal elements of LA T A and D 
— ijtjITji U — LA T AL T are identical, or 

\B k \ 2 = P k , (k = 1, 2, • • • , n). (15) 

Let Ak and B k be the kth. columns of A and B, respec- 

tively, and let e k be a scala, defined by 

| B k | =€*1-4*1, (k = 1, 2, • • • , n). (16) 

Since B k is a projection of A k , we have 

0 < €* < 1, (k = 1, 2, • • • , n), (17) 

and €*; is a measure of the figure loss encountered in con- 
structing B k from A k by orthogonalization, and there is 
no further figure loss by cancellation in computing, 

Pk = | B k | 2 , (k = 1, 2, • • • , n). (18) 

In the method of diagonal pivots, \B k \ 2 is formed 
from | A k \ 2 by repeated subtractions, and since 

| B k \ 2 = €* 2 | A k | 2 , (19) 


our measure of the figure loss in this method is e* 2 . Thus 
the method of orthogonalization has half the figure loss 
of the method of diagonal pivots. 

R ( X ) as a By-Product of Orthogonalization 
Let Z k , {k — l, 2, • • • , n-\~ 1) be defined by 

Zi = Y ( 20 ) 

and 

k 

Zk+i = Y - L (k = 1, 2, ■ • • , »). (21) 

1=1 

Then 

B k+1 T Z k+1 = B k+1 T Y , (k = 1, 2, • • • , n - 1), (22) 

since the columns of B are mutually orthogonal. Setting 
k = n in (21) and using (22), we have 

•Z.+1 = r - jb BAB?Y)/(B?B,). (23) 

y= i 

Thus Z n+ i is the component of Y orthogonal of S, or 

Z re+1 = - R ( X ). (24) 

The Inverse of A T A After the Application of 
the Method of Orthogonalization 

From (10), (11), and (14), we have 

LA T AL T = D. (25) 

Since L and L T are nonsingular, 

A T A = L- 1 Z2(L r )- 1 (26) 


and 

(. A T A )-! = L T D~ l L. (27) 

Since the calculation of D by the method of orthogo- 
nalization has half the figure loss of the method of diag- 
onal pivots, the former method using (27) yields a com- 
putationally more accurate inverse. The elements of 
this inverse matrix are useful to statisticians in the 
Theory of Error Analysis. 

Details of the Method of Orthogonalization 

The matrix U=L T need not be formed explicitly ex- 
cept in the evaluation of ( A T A)~ 1 by (27). 

Let U k be the matrix which is the nXn identity except 
for the elements in the &th row to the right of the diag- 
onal. These elements are the multiples of the kth. column 
of B which are added to the corresponding following 
columns to yield new following columns, which are 
orthogonal to the &th column. Then 

U = UJJ* • • • U n - 1. (28) 

When: 1) the nonidentity elements of U k have been 
formed and used to orthogonalize the following columns 
and Z k to B k \ 2) B k T Y = B k T Z k has been formed; and 3) 
\B k | 2 has been formed, then Bi is no longer needed and 
the storage cells used for Bi are now available for storing 
the nonidentity elements of U k and the scalar B k T Y 
= {B T Y) k . Repeating this process until k = n, X is 
evaluated by 

X = U 1 U 2 • • • U n -iD~ 1 (B T Z) ( 29 ) 

where we take advantage of all the known zero elements 
of the matrices involved in performing the indicated 
matrix premultiplications. 

For calculation of (A T A)~ l , we have from (27) and (14) 
(. A T A )~ 1 

= (L n —iL n — 2 • • * L i L 1 ) T Dr- l (L n . 1 L n - 2 • • • LHi). (30) 
The nonidentity elements of the product 
Lk(L k -iL k -2 • • • Li) 

may be stored in the locations occupied by the non- 
identity elements of the two factors, (k = 2, ■ • • , n — 1). 
Having formed L, the diagonal and subdiagonal ele- 
ments of (A T A)~ l can be formed and stored in the loca- 
tions occupied by D and L. 

Conclusion 

Although the number of operations involved is greater 
in the method of orthogonalization than in the method 
of diagonal pivots, the increased accuracy is well worth 
the time and effort. It is to be noted that the method 
of orthogonalization for weighted polynomial fitting is 
equivalent to forming a set of weighted orthogonal 
polynomials, fitting the data to these polynomials, and 
reducing the combination of these polynomials to a 
single polynomial in the manner of Tchebycheff. 
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The CORDIC Computing Technique 

JACK VOLDERf 


T HE “Coordinate Rotation Dig ital Computer” 
computing technique can be used to solve, in one 
computing operation and with equal speed, the 
relationships involved in plane coordinate rotation; con- 
version from rectangular to polar coordinates; multipli- 
cation; division; or the conversion between a binary - 
and a mixed-radix system. 

The CORDIC computer can be described as an entire 
transfer computer with a special serial arithmetic unit, 
consisting of 3 shift registers, 3 adder-subtractors, and 
special interconnections. The arithmetic unit performs 
a sequence of simultaneous conditional additions or sub- 
tractions of shifted numbers to each register. This per- 
formance is similar to a division operation in a conven- 
tional computer. 

Only the trigonometric algorithms used in the 
CORDIC computing technique will be covered in this 
paper. These algorithms are suitable only for use with a 
binary code. This fact possibly accounts for their late 
appearance as a numerical computing technique. Matrix 
theory, complex-number theory, or trigonometric iden- 
tities can be used to prove rigorously these algorithms. 
However, to help give a more intuitive and pictorial 
understanding of the basic technique, plane trigonom- 
etry and analytical geometry are used in this explana- 
tion whenever possible. 

First, consider two given coordinate components Y t 
and X{ in the plane coordinate system shown in Fig. 1. 


Y 



The subscript i, as used in this report, will identify all 
quantities with a particular step in the computing se- 
quence. The given components, Yi and X z , actually de- 
scribe a vector of magnitude Ri at an angle 6i from the 
origin according to the relationship, 

f Convair, Fort Worth, Tex. 


Y t = R t sin Oi (1) 

Xi = R t cos 6 Z . (2) 

With a very simple control of an arithmetic unit op- 
erating in a binary code, the sign of a number can be 
changed and/or the number can be divided by a power 
of two. Thus, if it is assumed that the numerical values 
of Y{ and X 8 are available, the numerical values of both 
coordinates of one of the proportional quadrature vect- 
ors, R/, can be easily obtained. 

Y/ = 2 ~’Xi (3) 

X/ = - 2-i Yi (4) 

where j is a positive integer or zero. 

The vector addition of R/ to R z , by the algebraic 
addition of corresponding components, produces the 
following relationships: 

Y i+l = VI + 2r*’'R t sin (0 t - + tan" 1 2 -*) = F,- + 2~'Xi (5) 

X i+1 = VI + 2 -VRi cos (6i + tan" 1 2"0 = X< - 2~»'F,- (6) 

Ri+i = VI + 2-VRi. (7) 

Likewise, the addition of the other proportional quadra- 
ture vector at 6 — 90° to the vector R z produces the fol- 
lowing relationships: 

Y i+ i = VI + 2 -VRi sin (6* - tan" 1 2 ~f) = Y t - 2 -»X< (8) 

X ifl = VI + 2 ~VRi cos (9 Z - tan- 1 2~0 = X; + 2 _/ F; (9) 

R i+ i = VI + 2 -**Ri. (10) 

If the numerical values of the components F* and X z 
are available, either of the two sets of components F;+i 
and Xi+i may be obtained in one word-addition time 
with a special arithmetic unit (as shown in Fig. 2) op- 
erating serially in a binary code. 



Xi+i 

Fig. 2 — Arithmetic unit for cross addition. 


This particular operation of simultaneously adding 
(or subtracting) the shifted X value to F and subtract- 
ing (or adding) the shifted F value to X is termed “cross 
addition.” 
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The effect of either of these two choices can be con- 
sidered as a rotation of the vector Ri through the special 
angle plus (or minus) a; where 

ai — tan -1 2~ } ' (11) 

accompanied by an increase in magnitude of each com- 
ponent by the factor (1 +2 -2j ’)^. 

Note that this increase in magnitude is a function of 
the value of the exponent j and is independent of which- 
ever of the two choices of direction is used. If a particu- 
lar value of j is specified to correspond to a particular 
value of i in the general expression, and if it is specified 
that, for every it\\ term, one and only one of the two 
permissible directions of rotation is used to obtain the 
i-\- 1 terms, then the choice may be identified by the 
binary variable £* where £* = +1 for positive rotation, or 
— 1 for negative rotation. This gives a general expres- 
sion for the ^ + 1 terms as 

Y i+ 1 = VI + 2 ~*iRi sin & + £m) = Y i + &2 (12) 
X, + i = VI + 2 ~VR t cos (6i + UV = Xi - fc2-»Tf (13) 
-R*fi = vi + 2 F^Ri. (14) 

After the components F, + i and X; + i are obtained, an- 
other similar operation can be undertaken to obtain the 
i-\- 2 terms. 

Y i +2 = Vl + 2 _2(3+1) [Vl + 2 ~ 2] R{ sin (0*+ £;«»•+ £;+iOi+i)] 

= F l+ x + &+i2~ (j ’ +1 } X i+ 1 (15) 

Xi + 2 = Vl + 2 _2(,+1) [Vl + 2 ~ 2i Ri cos (0*+£iOi+£i+ia:j+i)] 

= Xi+i — U+N~ u+l) Y i+1 (16) 

R i+ 2 = Vl + 2~ 2(;+1) VI + 2- 2 ^-, ( 1 7) 

Likewise, the pseudo-rotation steps can be continued 
for any finite, pre-established number of steps of pre- 
established increments but arbitrary values of sign. 
After these steps have been completed, the increase in 
magnitude of the vector as a result of these steps will be 
the constant factor 

VI + 2- 2 '- VI + 2~ 2(j ' +1) • VI + 2~ 2(,+2) • • • 

•VI + 2~ 2 "\ (18) 

The effective angular rotation X of the vector system 
will be the value of the algebraic summation of the in- 
dividual rotations. 



X — £icki + £ 20:2 + £ 3^3 + • • 

“F £reOJn 

(19) 

where 





ai = tan -1 2~ ] 


(20) 

and 





£* = + 1 or -1. 


(21) 


Therefore, although the magnitude of each individual 
rotation step is fixed, there now appears the possibility 


that, by an appropriate choice for each £, the algebraic 
summation of all steps can be made to equal any desired 
angle. 

The requirements for making this sequence of steps 
suitable for use with any angle as the basis of a comput- 
ing technique are: 1) a value must be determined for 
each angle a; so that for any angle 6 from —180° to 
+ 180° there is at least one set of values for the £ op- 
erators that will satisfy (19), and 2) these chosen values 
must permit the use of a simple technique for determin- 
ing the value of each £ to specify X. 

The following relationships are necessary and suf- 
ficient for a sequence of constants to meet these require- 
ments : 

180° < + «2 + «3 + • • • -f- a n + a n (22) 

< oii + 1 + a*+2 +•••+«?! + OL n . (23) 

The following sequence meets the requirements o^ 
(22), (23) and (11): 


First term: 

ai = 90° 

(24) 

Second term: 

0:2 = tan -1 2~° = 45° 

(25) 

Third term: 

a 3 = tan" 1 2 _1 « 26.5° 

(26) 

General term: 

a x — tan -1 2- (i_2) (i > 1). 

(27) 


Any angle can now be represented by the expression 
X = £i(90°) + £ 2 tan" 1 2~° + £3 tan" 1 2" 1 + • • • 

+ £ n tan" 1 2-<”~ 2 >. (28) 

The combination of values of the operators £i£ 2 • • • £„ 
form a special binary code which is based on a system of 
Arc Tangent Radices and will be identified as the ATR 
code. The values of a selected for this computing tech- 
nique will be called ATR (Arc Tangent Radix) con- 
stants. 

In addition, only one more term is required in this 
ATR system than that required in a perfect binary- 
radix system for equivalent angular resolution; 

( n — l)th term of perfect binary system ( + variable) 

= 2~ n revolutions (29) 
nth. term of ATR system (for large n ) 

2~(n-l) 

~ revolutions (30) 

7 r 

2-(n-i) 

< 2 - \ (31) 

7 r 

Note that all terms except the first are terms of the 
natural sequence tan -1 2~ J ' (j = 0, 1,2, etc.) and may be 
instrumented as shown in Fig. 2. 

The computation step corresponding to the most 
significant radix is simply 

F 2 = R x sin (d x + £x90 o ) = ^X x (32) 

X 2 = Ri cos (<h + £ x 90°) = - £iF x (33) 
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where 

Rz = Ri. (34) 

This step is unique in that no magnitude change is intro- 
duced. It may be instrumented with the same cir- 
cuitry required for all of the other steps by simply dis- 
abling the direct input to the adder-subtractor during 
this step. 

The change in magnitude of the components resulting 
from the use of all of the terms in the series of (28) is the 
constant factor 

VI + 2-°Vl + 2- 2 Vl + 2- 4 • • • 

. Vi + 2 _2(re_2) . (35) 

The value of this magnitude factor is a function of n 
which can be a constant for any given computer. By 
arbitrarily solving for the factor for n = 24 and by de- 
noting the value of the magnitude change factor as K, 

K = 1.646760255. (36) 

At this point, these individual steps can be fitted into 
a complete computing technique. Of the two basic 
algorithms that will be described here, the problem of 
"vectoring” will be considered first. Vectoring is the 
term given to the conversion from rectangular-coordi- 
nate components to polar coordinates, that is, given the 
Y and X components of a vector, the vector magnitude 
R and its angular argument 0 are to be computed. In 
this technique, R and 0 are computed simultaneously 
and in separate register locations. 

First consider the problem of computing R. Except 
for the known magnitude change K, the Pythagorean 
relationship of the coordinate components is main- 
tained regardless of the value of the summation of rota- 
tion angles, X. Then, if the individual directions of rota- 
tions, can be controlled so that, after the end of the 
computing sequence, the Y component is zero and the 
X component is positive, 

R n + 1 = VX n+1 2 + Y n+1 2 = X n+1 = KR 1 

= KVX i 2 + Fi 2 . (37) 

The technique for driving 0 to zero is based on a 
numerical nulling sequence similar to nonrestoring 
division. 

Since the vector Ri is described only in terms of its 
rectangular-coordinate components, the angle of this 
vector di from the origin (positive X axis) is not known. 
However, if di is considered to be expressed in a form 
so that 

— 180° < di < 180°, (38) 

then it can be shown that the sign of the F* component 
always corresponds to the sign of the angle di. 

Therefore, before each step of the computation, the 
sign of Yi may be examined to determine which of the 
two possible values of £*• will drive d { + 1 opposite in direc- 


tion from 6i and then set the action of the adder- 
subtractor accordingly to obtain the relationship 

| 6i + 1 1 = | | di | — ai | . (39) 

Regardless of the choice for , the shift gates and the 
register gates can be controlled so that the increments 
of rotation prescribed by the sequence of ATR constants 
are used in the same order (most significant first) as 
shown in (24)-(27). 

It can be shown that, by adding another term a n to 
the summation of all ATR constants, the summation 
is greater than or equal to 180° for any value of n\ 

180 ° < ai + £*2 + «3 + ••• + «»» + ot n . ( 40 ) 

By expressing the angle of any vector in the form 
given by (37), the following relationship is obtained: 

| di | < oci + £*2 + a$ + • • • + a n + a n . (41) 

Although it has been previously stated, without proof, 
it can be readily shown that, for the 7th term, 

cti < cti + 1 + a; +2 + • • • + a„ + a n . (42) 

Therefore, if the same rules given for (39) are applied 
to determine d{ + 1 , 

— ax < | | — oti < ck2 + + • • • + a n T oi n . (43) 

Then, by applying the inequality of (42) to the left-hand 
term of the above equation, 

| 02 | = | | 01 | — Oil | < <*2 + <*3 + ' ‘ - + &n + Oi n . (44) 

Likewise, this process may be continued through a n to 
give 

| 0n+l | ^ OL n . (45) 

As an illustration of the step-by-step value of the vec- 
tor, as described by the coordinate components at each 
step during the vectoring operation, consider the exam- 
ple in Fig. 3. 


Y 



Fig. 3 — Step-by-step relationships during nulling. 

If, at the end of the computing sequence, the co- 
ordinate components specify a d i+ 1 equal to zero, the 
total amount of rotation performed was equal in magni- 
tude but opposite in sign to the angle 0i as specified by 
the original coordinate components Fi and X\. 
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At this point another register (identified as the angle 
register) and another adder-subtractor may be intro- 
duced, and it shall be assumed that the numerical value 
of each of the preselected ATR constants is stored with- 
in the computer and can be made available to the arith- 
metic unit in the same order as specified in (28). Since 
each £ controls the action of the cross addition, each may 
also control the action of the additional adder-subtractor 
so that a subtraction or an addition may be made simul- 
taneously in the angle register of the numerical value 
of the corresponding ATR constant to an accumulating 
sum to obtain the numerical value of the original angle 
0i. Then, at the end of the computation, the desired 
numerical value of 0i is in this additional register, and 
the quantity KRi is in the X register. 

A block diagram of the complete arithmetic unit 
necessary for computing both R\ and 0 is shown in 
Fig. 4. 


Yl+l 



Fig. 4 — Block diagram of complete arithmetic unit. 


In summarizing the vectoring operations, the initial 
coordinate components Fi and Xi of a vector are given, 
and the quantities R n+ i and 0i are computed where 

R n+ i = KRi = KV'XC + Fi 2 (46) 

and 

Fi 

0i = tan 1 (47) 

X\ 

and K is the constant magnitude-change factor de- 
scribed in (36) and (37). , 

A typical computing sequence is shown in Table I. 
The two’s complement rotation is used for all quantities 
and for simplicity, shifted quantities are simply trun- 
cated without round-off. 

Next, consider the application of the previous step- 
by-step relationships for the solution of the problem of 
coordinate rotation. It will help in applying this tech- 
nique to consider, as before, the coordinate system as 
being fixed and the vector as being rotated. Then the 
solution for coordinate rotation requires the solution 
of the numerical values of the two coordinate com- 
ponents of a vector that has been rotated by a given 


TABLE I 

Typical Vector Computing Sequence 


F Register 

X Register 

Angle Register 

Fi= 0.0101110 

1. 1000101 = Xi 

0.0000000 


-1.1000101 

+0.0101110 

+0.1000000 

tan -1 co 

0.0111011 

0.0101110 

0.1000000 


-0.0101110 

+0.0111011 

+0.0100000 

tan -1 1 

0.0001101 

0.1101001 

0.1100000 


-0.0110100 

+0.0000110 

+0.0010010 

tan” 1 2” 1 

1.1011001 

0.1101111 

0.1110010 


+0.0011011 

-1.1110110 

-0.0001001 

tan” 1 2” 2 

1.1110100 

0.1111001 

0.1101001 


+0.0001111 

-1.1111110 

-0.0000101 

tan” 1 2” 3 

0.0000011 

0.1111011 

0.1100100 


-0.0000111 

+0.0000030 

+0.0000010 

tan” 1 2~ 4 

1.1111100 

0.1111011 

0.1100110 


+0.0000011 

- 1.1111111 

-0.0000001 

tan” 1 2” 6 

1.1111111 

0.1111100=^1 

0.1100101 = 

e 


angle X from its original position, as defined by the given 
initial coordinate components Fi and X\. 

The desired angle of rotation will be given in binary 
coded form. By placing the angle X in the angle register 
and sensing- the sign of the quantity in this register be- 
fore each step, the quantity in the angle register may 
be nulled to zero by sequentially subtracting or adding 
each of the ATR constants to the remaining quantity. 

The explanation and proof of this nulling operation, 
in which the actual numerical values of the angles are 
employed, is the same as that for the previous nulling 
operation for vectoring. 

Immediately following the determination of each 
ATR digit, and concurrently with the operation of the 
subtraction or addition nulling operation in the angle 
register, the operation of cross addition of shifted quanti- 
ties may be performed in the F and X registers to 
rotate the vector in a direction determined by each £ 
with an angular magnitude as specified by ct* corre- 
sponding to the ATR constant being used. Then, at the 
end of the computing sequence, the numerical values 
of the desired components Y n+ 1 and X n+1 will be in the 
F and X registers, respectively. 

In summarizing the coordinate-rotation operation, 
the initial coordinates F* and Xi and the desired angle 
of rotation Xi are given where 

Y i = Ri sin 0x (48) 

Xi = Ri cos 0i. (49) 

The results available at the end of the computation 
sequence are 

F n+ 1 = Rn+1 sin (01 + Xi) = KRi sin (0i + X x ) (50) 

X n -\-i R n + 1 cos (0i + Xi) = KRi cos (0i + Xi) (51) 

where K is the same constant magnitude-change factor, 
given in (36) and (37), that applies for the vectoring 
operation. 
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A typical coordinate rotation computing sequence is 
shown in Table II. (Note that in this computation, the 
vector is rotated through the negative X axis.) 

Since the change of magnitude will be exactly known 
beforehand, it may be compensated for exactly either 
by scaling or by a magnitude-correction multiplication. 
It may then be said that, except for the practical con- 
sideration of limiting the number of digits and the 
number of steps to some finite value, both algorithms 
produce an exact solution. In a practical computer, no 
approximations are necessary except round-off. 

In applying this computing technique to practical 
problems, the complete solution may be programmed 
by considering the computer as the digital equivalent 
of an analog resolver. 

If the analog-resolver solution flow for the particular 
problem is known, then the number of operations and 
the information-flow diagram can be obtained simply 
by substituting for each resolver, on a time-shared basis, 
a CORDIC operation. 


TABLE II 

Typical Rotation Computing Sequence 


Y Register 

X Register 

Angle Register 

Fi= 0.0101110 

1. 1000101 = .Yi 

0.1100101 = 

X 

+1.1000101 

-0.0101110 

-0.1000000 

tan -1 co 

1.1000101 

+1.1010010 

1.1010010 

-1.1000101 

0.0100101 

-0.0100000 

tan -1 1 

1.0010111 

+0.0000110 

0.0001101 

-1.1001011 

0.0000101 

-0.0010010 

tan -1 2 -1 

1.0011101 

-0.0010000 

0.1000010 

+1.1100111 

1.1110011 

+0.0001001 

tan -1 2 -2 

1.0001101 

-0.0000101 

0.0101001 

+ 1.1110001 

1.1111100 

+0.0000101 

tan -1 2 -3 

1.0001000 

+0.0000001 

0.0011010 

-1.1111000 

0.0000001 

-0.0000010 

tan -1 2 -4 

1.0001001 

-0.0000001 

0.0100010 

+1.1111100 

1.1111111 

+0.0000001 

tan -1 2~ 5 

1.0001000 

0.0011110 

0.0000000 



Monte Carlo 


Calculations in 


Statistical Mechanics 


W. W. WOODf AND J. D. JACOBSON f 


I. Statistical Mechanical Introduction 

ACCORDING to classical statistical mechanics, the 
f—\ thermodynamic properties of a system of N 
molecules at temperature T and volume V are 
obtainable from the Gibbs configurational phase inte- 
gral, 


given by (1) is the associated normalizing factor. For 
example, the pressure p is given by the average of the 
“virial” Vr of the total intermolecular force: 

pV/NkT = 1 - (1 /3NkT){V R ), (3) 

where 


Z n (T, V) = f ' N f e~ u ' kT dry ■ ■ ■ dr N , (1) 

J v J V 

where k is Boltzmann’s constant; U is the potential 
energy of the system of N molecules, and is a function 
of the position vectors r a , a = 1,2, • • • , N, of the mole- 
cules. For suitably simple molecules one usually as- 
sumes U to be expressible as a sum of spherically sym- 
metric pair interactions u(r ) : 

U{r h • • • t n ) - — 2 Yj' ( 2 ) 

2 a— 1 |3=1 

where r a p = \\rp — r a ||, and the prime indicates omission 
of terms for which a=(3. 

Most of the thermodynamic functions are expressible 
in terms of “ensemble averages” of some related function 
of the configurational coordinates. In these averages the 
factor e~ u/kT appears as a weighting factor, so that Z N 

f Los Alamos Scientific Lab., Los Alamos, N. M. 


{V r) = (1/Ztf) Jv N j yl/2 r a0 du(r a P)/dr a ^jx 

(4) 

e ~ u / kT dry ■ ■ ■ drN 

The “radial distribution function” g(r) is of consider- 
able importance in the study of fluids. Let us first define 
n{r ), the “cumulative radial distribution function,” 
giving the average number of molecules lying within the 
distance r from any representative molecule: 

n{r) = (l/iF)^E' r)^, (5) 

where A(x, r ) is the step function 

A(x, r) — 1 0 < x < r) 

> • ( 6 ) 

= 0 r S xj 

Then g{r), which is the number density of molecules at 
the distance r from any representative molecule, rela- 
tive to the over-all macroscopic density N/ V, is given by 



262 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


g(r) = ( V/4wNr*)dn(r)/dr . (7) 

In terms of g(r ) (3) can be written 
2 t t N 1 r°° 

pV/NkT = 1 ~ - — — ru'(r)g(r)rHr. ( 8 ) 
3 F kT J o 

Although more involved potential functions u(r) are 
required to predict the thermodynamic properties of 
real molecular systems [l], there is much theoretical 
interest in the simple hard sphere (sometimes called bil- 
liard ball, or elastic sphere) model for which 

u(r ) =oo r < a , 1 

9 

= 0 r ^ a; 

a is the molecular diameter. This form of u(r ) also leads 
to considerable computational simplification in the 
Monte Carlo procedure, and we shall refer most of our 
discussion to this case. Eq. (8) then becomes 

2-ir N 

pV/NkT = 1 + — — 

= 1 + an' (<r)/ 6; (10) 

the pressure depends only on the value of the radial dis- 
tribution function at the surface of the molecule. 

II. Theoretical Basis of the Monte 
Carlo Method 

For a macroscopic system, F^l cm 3 , N is of the order 
of 10 22 , which is a considerable obstacle to either ana- 
lytical or numerical treatment. Just as the various 
theories in one way or another radically reduce the di- 
mensionality of the problem, a similar reduction is 
necessary before undertaking numerical work. Here this 
is done by applying the periodic boundary condition to 
a basic system of perhaps a few hundred molecules. The 
basic volume V is taken to have a shape such that it 
fills space compactly under successive translation along 
three axes in space. There are N molecules in each 
replica V, in the same relative positions, and in evalu- 
ating U from (2) the sum over a is limited to the N 
molecules in a particular reference V, but the sum over 
is allowed to extend over the adjacent replicas out to 
some appropriately determined cutoff point. Particu- 
larly at high densities, near close-packed, it is important 
that N be chosen so that a regular lattice of N points 
fixed in V generates under the above translations the 
corresponding regular lattice throughout space. Most 
of our work has used a cubical V for which the natural 
choice of lattice is cubic-close-packed (face-centered 
cubic); then the possible choices of N are 4 n 3 , n = 1, 
2, • • • , or 2V = 4, 32, 108, 256, 500, • • • . N=32 is the 
first reasonably realistic choice, and most of our calcu- 
lations have been done for this value. The periodic 
boundary condition is illustrated in Fig. 1 for a two- 
dimensional square V containing four molecules. 

For hard spheres pV/NkT can readily be shown to 
be a function only of the reduced volume parameter r, 
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Fig. 1 — The periodic boundary condition in two dimensions 
with four molecules in a square V. 


defined by 


N 

V<tW2 ’ 


(ID 


(The parameter so defined is the ratio of the volume V 
actually occupied by the system to the volume occupied 
at regular close packing.) For finite values of N, with 
the periodic boundary condition, some additional de- 
pendence upon N, for fixed r, may be expected, and one 
object of the investigation should be to estimate this 
dependence and perhaps extrapolate to N = <*> , since 
it is this limiting value which will correspond to the de- 
sired thermodynamic value. 

By use of the periodic boundary condition the prob- 
lem has been approximated by one of a few hundred 
degrees of freedom, which is in the region where ana- 
lytical methods fail but where Monte Carlo methods on 
electronic calculators may be expected to be feasible. 
However, the most obvious Monte Carlo procedure, the 
evaluation of Zn as given by (1) by means of independ- 
ent sampling of points in 3N dimensional configuration 
space, proves not to be feasible at interesting values of 
V/N [2], The method to be described here is due to 
Metropolis, et al. [3], and involves instead a Markov 
chain whose states are points in 3N dimensional con- 
figuration space. A Markov chain may be regarded as 
the next level of stochastic complexity beyond the sim- 
ple series of independent events such as a series of 
throws of a die, or the usual Monte Carlo integration 
procedure just mentioned. In contrast to the latter, in 
a Markov chain the probability of obtaining a given 
result at the tt h step depends explicitly on the result on 
the ^ — 1th step (and therefore implicitly on all preceding 
steps). The simplest example is the well-known random 
walk, and the present method is in fact a random walk 
in 3N dimensional configuration space. The Markov 
chain method usually does not result in an evaluation of 
Zn (1), but instead permits the direct evaluation of con- 
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figurational averages such as appear in (4) and (5). 1 * The 
Markov chain is so constructed that the average of any 
function of configuration over all states in the chain 
tends toward the corresponding ensemble average [e.g., 
(4) ] as the chain is developed indefinitely. 

The discussion of Markov chains in which the admis- 
sible states form a discrete, finite set is somewhat 
simpler than that required for the case of a continuum 
of states, and we shall therefore take advantage of the 
fact that the use of digital calculation itself imposes a 
“quantization” of configuration space determined by 
the number of binary bits (in our calculations, 17) used 
to specify the molecular coordinates. Let us then repre- 
sent possible configuration states by the letters j, k, and 
l, and let p(j, k) be the probability, if at time t the con- 
figuration state is j, that at time t-\- 1 the state will be k. 
To describe the development of the chain we use the 
usual “time” terminology; it should be emphasized that 
this is not a time directly related to molecular times, but 
may conveniently be thought of as corresponding to 
calculating machine time. The p(j, k) are independent 
of t, and for the desired averaging property must satisfy 
the following conditions [l ] : 

£ PU, V = i, (12) 

k 

and 

p(J , k)e- u ^ kT = p{k,j)e~ u ^‘ kT . (12a) 

In addition, there is an ergodic condition on the matrix 
p(j, k ) : all states included (with nonzero weight) in the 
integral in (4) must belong to a single ergodic class [5]. 
That is, for every pair of such states j, k there must exist 
at least one finite n such that p M (j, k) ^0; p w {j, k) is 
the probability of passing from state j at time t to state 
& at time v 

p a) (j, k) = p(j, k ) ; p M (j, k) = 2 p (n ~ l) (j, l)P(l,k). (13) 


Among the p(j, k) matrices satisfying (12) and (12a) 
we use the following: 

p(j,k) = A(j,k) if U(k) S 1700; 1 


= A{j,k) exp [-(£/(*) - U(j))/kT], 


itk j; (14) 


if U(k) > U (j ) ; J 


/'O’, i) = 1 - £' p(j, k), 

k 


(15) 


where 


A(j,k) = 1/8A5 3 , if r(a; k) = r(a;j), 

a = 1, 2 • • .0 - 1,0+ 1 • • • A; 
and r(0, k) G C 9 [r(p;j)], (3 = l, 2, ■ - • N; (16) 
= 0, otherwise. 


1 For a case in which the Monte Carlo method leads to an evalua- 

tion of Zn, see Salsburg, et al. [4J. 


In (16) Cs [r(/3, j) ] denotes a cube of half edge 8, oriented 
parallel to the coordinate planes, with center at r(/3, j ) ; 
the latter notation means the position vector of molecule 
in configuration j. The length 8 is a parameter of the 
Monte Carlo process whose value is found to influence 
the rate of convergence of the process. It is ordinarily 
small (see Section IV) compared to the edge of the cube 
V, and in (16) it is expressed in units of the smallest 
increment in the molecular coordinates; in our calcula- 
tions, therefore, in units of 2 -17 times the edge of the 
cube V. 

The motivation underlying this specification of 
p(j, k) is as follows. The factor A(j, k) corresponds to 
a uniform choice among the limited number of neighbor- 
ing states contained in Cs; this limitation is related to 
the small mean free path in molecular systems at inter- 
esting densities. The unsymmetrical form of the Boltz- 
mann weighting factor (14) is chosen in preference to 
the alternative symmetrical form 

e -U(.k)/kT 

p(j,k) = A(j,k) (17) 

rJ g — U (?) IkT _|_ e -U(k)lkT V ' 

mostly for computational convenience, since it leads to 
fewer calculations of the exponential function (in the 
case of more complicated models than hard spheres), 
but also because the unsymmetrical form leads to a 
more rapid motion in configuration space. 

For hard spheres (14) reduces to 

p(j,k) = A(j, k), if U{k) = 0, 

= 0, if XJ{k) = oo, (18) 

assuming U(j) =0, the only case with which we will be 
concerned since the initial state of the chain has U — 0. 

It can be easily shown that if u(r) has only point in- 
finities (say, at r = 0), then p(J, k ) defined above also 
satisfies the ergodic condition. However, the hard sphere 
u(r) is infinite over a finite interval in r, with the result 
that U in 3N dimensional space is infinite over regions 
of considerable extent, from which the state point repre- 
senting our system is excluded. The topological ques- 
tion of whether the accessible region in which U is zero 
is then connected or not, depending on the values of 
N/V and <r, is one to which the answer is unknown, and 
as a result the equivalence of the Markov and “en- 
semble” averages is somewhat doubtful. Under such 
circumstances the p{j , k ) defined above may separate 
the configuration states into separate ergodic classes. 
Within each class (i.e., depending on the choice of 
initial configuration) the Markov average will converge 
to an average equivalent to the ensemble average taken 
over a restricted region of configuration space. However, 
it should also be mentioned that the p(j, k ) defined by 
(14) and (15) may actually connect spatially discon- 
nected regions of configuration space, if the thickness of 
the dividing barrier is not too large compared to the 
parameter 8. (This is the only circumstance in which 
the value of 8 will affect the convergent average of the 
Markov chain.) 
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Such a compartmentalization of configuration space 
also raises the well-known quasi-ergodic problem in sta- 
tistical mechanics concerning the equivalence of “en- 
semble” averages to the time-average behavior of an 
actual physical system of N molecules moving according 
to the Newtonian equations of motion for the given 
potential function. In this connection we call attention 
to the work of Alder and Wainwright [6], in which these 
equations of motion for the same systems of hard 
spheres with periodic boundary conditions have been 
integrated. Thus it is possible to compare the results of 
this kinetic theory calculation with the present statis- 
tical calculation. In a later part of the paper some com- 
parisons of this sort will be made. 

As mentioned already, the topological question of the 
connectivity of configuration space has not been solved. 
It is quite evident, for example, that at sufficiently high 
densities, near close packed, it will not be possible to 
interchange the positions of two molecules without 
crossing a region in which U is infinite. Configuration 
space is then evidently compartmentalized into some- 
thing like N\ disconnected regions. However, for sys- 
tems of N identical molecules (i.e., a one-component 
system) this compartmentalization is of no consequence 
since all ensemble averages have identical values in all 
such compartments (i.e., for hard spheres, the volumes 
of the compartments are all the same). It is the question 
of whether, associated with this trivial partitioning of 
the configuration space, there is a further, nontrivial 
partitioning into regions of different properties, which 
is uncertain. In particular, for example, if such a non- 
trivial compartmentalization does occur, how does it 
depend on the choice of N, at constant N/V and <r? Is it 
an artifact of the periodic boundary condition? We will 
have occasion to return to these questions when we 
describe below the results obtained with the method. 

III. Calculation al Procedures 

It is convenient to take as unit of distance the edge 
of the cubical volume V, so that in these units V=l. 
For a given choice of N, specification of r then deter- 
mines the value of a, by (11). We first outline briefly, 
then discuss in greater detail, the steps by which the 
calculator is caused to develop a particular sample chain 
according to the Markov process defined by (14)— (16) , 
for a system of hard spheres. 

1) Assume that at “time” t the system is in the state 
j(t), corresponding to subscript^ in (14)-(16). 

2) A random choice of one of the N molecules is made, 
corresponding to subscript (3 in (16). We call this mole- 
cule (3(t). 

3) Each coordinate of molecule (3(t) is given a tenta- 
tive random displacement, uniform on the interval 
(—5, 5), corresponding to r((3, k ) in (16). Call the con- 
figuration in which molecule fi(t) is in this displaced 
position, and the other N— 1 molecules a have position 
r(a, j(t)) configuration j'(t). 


4) Configuration j'(t) is tested for overlaps; i.e., one 
or more pairs of molecules whose distance between 
centers is less than the molecular diameter a. 

5) a) If no overlap is found in Step 4, then the next 

configuration in the chain is j'(t); i.e., =j' (t) . 

b) If configuration f (t) contains an overlap, the 
configuration at time tfi- 1 is identical with that at time 
t:j(t + l) =j(t). 

6) Certain procedures concerned with the averaging 
process are performed; the procedure then repeats be- 
ginning at Step 1, except for occasional interruptions 
for checking and census procedures. We now describe 
each of these steps in greater detail. 

A. Specification of a Configuration 

Here we describe the information carried in the calcu- 
lator memory for purposes of specifying a particular 
configuration. Fundamentally all that is required are 
the values of the 3N coordinates r(a), which are stored 
in the r-Table. For economy of calculating time, how- 
ever, it is desirable to carry along additional redundant 
information, since the calculation requires a rather 
large amount of machine time. For example, in con- 
nection with the calculation of the “cumulative radial 
distribution function” n(r ) (5), it is desirable to have 
available for each configuration j(t) a tally, called the 
C-Table, of the intermolecular distances (squared) r a fi 
into the Nr intervals C(v) = [cr 2 +j'-2~ s , o" 2 + (v + l)2 _s ], 
^ = 0,1,-*-, Nr — 1. Here s is chosen to be an integer 
in order to expedite the tallying process. When we are 
interested only in estimating the pressure, using (10), 
so that only g(<r) is required, the range of the C-Table 
can be made quite small, with a consequent economy of 
machine time, as will appear below. 

The search for overlaps in Step 4 can be considerably 
expedited by a device which avoids the necessity of 
examining all N— 1 values of r a fi. 2 The basis for the de- 
vice is the fact that the displacement parameter 8 is in 
all interesting cases relatively small compared to unity 
(edge of V ), so that any actual overlap r a p which occurs 
must be with a molecule a which was fairly close to / 8 in 
configuration j(t). We proceed as follows. At some point 
to in the chain (in particular, at its beginning t = 0) we 
establish a set of tables called the ju (a) -Tables, one for 
each a = l, 2, • • • , N. Each ju (a) -Table consists of an 
indefinite number of entries fx(a, 7 ), 7 = 1, 2, • • • , and 
gives all molecule numbers p(a, 7 ) for which r a P(a,y) <dfi 
in configuration j(to) . The parameter d\ must satisfy the 
relation 

dP > dp = o 2 + 2 ~°Nr, (19) 

but is otherwise arbitrary, and can be adjusted for opti- 
mum calculation rate. Also at to the positions of all N 

2 Note that since configurations j'(t) and j(t ) differ only in the 
coordinates of molecule jS, only these N— 1 distances have changed. 
Thus if j(t) is free of overlaps (which will be the case unless an error 
has occurred at some previous value of t), any overlap in j'(t) will 
occur among this set of N— 1 interactions. 



Wood and Jacobson: Monte Carlo Calculations in Statistical Mechanics 


265 


molecules are recorded in the r 0 -Table; we shall call the 
configuration specified by this fo-Table “configuration 
jo.” A simple application of the triangle inequality 
shows that as long as all molecules remain within a 
distance 

d = (di- d z )/2 (20) 

of their fo-Table positions, only those molecules in the 
Ijl ( a) -Table can approach closer than d 3 to molecule a. 
Thus as long as this “diffusion condition” is satisfied we 
need test in Step 4 only the molecules contained in the 
„[/3(/)]-Table.° 

The recent availability of considerably larger core 
memories on the Los Alamos 704 calculators has made 
it possible to consider carrying other redundant informa- 
tion in order to increase further the calculation rate. 
Some of these possibilities are mentioned later in the 
discussion. 

The starting configuration most often used is a regu- 
lar face-centered lattice arrangement. 

B. Choosing Molecule (3(t) 

This is done by the usual technique of taking the in- 
teger part of the product %N, where £ is a nominally 
randomly, uniformly distributed variable on the interval 
[0, l], obtained from a sequence of pseudo-random 
numbers generated by the middle square process [7]. 
We use a sequence of 70 bit numbers (generated by 
double precision techniques) which has not been sta- 
tistically analyzed except to verify that there are no 
repetitions among the first 10 6 70 bit numbers. Faster 
techniques [8] can be used to generate pseudo-random 
numbers, and are currently being incorporated in the 
codes. The generation of all the random numbers re- 
quired in the complete calculation uses at most about 
7 per cent of the calculation time, however, so that no 
large improvement can be made by this change alone. 

C. Displacing Molecule (3(t) 

The procedure is similar to that in Step 2, the dis- 
placed position being given by 

r[P(j)\j'(f)] = 4/3(0 ;i(0] + (21) 

where the three components of ^ are independent ran- 
dom numbers uniform on the interval ( — 1, 1). We 
may mention that as a consequence of the periodic 
boundary condition, if molecule fi{t) due to the displace- 
ment given by (21) leaves the volume V through one 
face, it re-enters V through the opposite face; thus the 

3 Actually, one might set up two M(a)-tables, mi(«) giving inter- 
actions which are potential overlaps with a [entries being determined 
by di =<r+2d (22)]; /«(«) defined as in the text, giving potential 

C-Table interactions. This might be worthwhile when d 3 is relatively 
large compared to <r [for example, if g(r) is to be calculated over a 
range of several molecular diameters]. In the present investigations 
we have concentrated on determining the pressure p\ as we have 
seen since only g(<r) must then be determined, d 3 may be quite close 
to a- (we use d 3 = 1.05 <r), and not much advantage would be gained 
from the additional complication. 


number of molecules N contained in V remains con- 
stant (see Fig. 1). 

Steps 2 and 3 together require about 19 per cent of 
the average calculating time per configuration, with the 
present code. 

D. Testing j' (t) for Overlaps , and Tallying its Interactions 
into the C-Table 

The distances 3 4 \\r [(3(t) ; j'(t) ] -r[n(fi{t), y); j(0]|| are 
computed for 7 = 1, 2, • • • ; if for any 7 an overlap is 
found, control passes immediately to Step E2. Those 
distances less than d 3 are tallied into an E-Table whose 
structure is identical with that of the C-Table already 
described. If none of the entries in n[fi(t )] produce an 
overlap, the diffusion test is made to see whether 
r [1 ®(t),j'(t)] is within the distance d of the fo-Table posi- 
tion stored for molecule /?. If it is, control passes im- 
mediately to Step El. If the diffusion condition is 
violated, control enters a routine called “refresh” which 
has the following functions: 

1) All N— 1 interactions ||r [(3(t) ; j'(t) ] — r [«; jo]|| for 
a^^{t) are calculated, and the values of a for which 
these distances are less than di are entered in a tempo- 
rary ITVTable, which may eventually replace the cur- 
rent fx jj 8(t) ]-Table. 

2) As the procedure described in 1) is carried out, the 

distance ||r ; j'(t) ]—r[a; j(*)]|| is also calculated for 

every value of a which is placed in the HA-Table but 
which is not found in ji [fi(t) ]. If any of these interactions 
in configuration j'(t) is found to be an overlap, control 
passes immediately to Step El. Those which lie in the 
interval (a, d s ) are tallied into the E-Table already 
mentioned. 

3) Those values of a which are entered in the W 2 - 
Table, but which are not contained in n[(3(t)] are tabu- 
lated, and if no overlap is found in j'(t), the entry j Q(t) 
is inserted in p(a) for these values of a. 

4) Those values of a which are not entered in the 
TFVTable, and which appear in n[(3(t)], are tabulated, 
and if no overlap is found in j'(t), the entry (3(t) is re- 
moved from /jl(o) for these values of a. 

5) If no overlap is found in j f (t), r[/3(/) ;j'(t)] replaces 
r[/3(t) ; Jo], and control passes to Step El. 

The testing of j'(t) for overlaps, exclusive of the time 
spent in the “refresh” routine, is the most time-consum- 
ing part of the calculation, using about 46 per cent of 
the total time; the “refresh” routine uses about 16 per 
cent of the time. These figures are for values of the 
parameters controlling the calculation rate which seem 
to be optimum (see below). 

El) No Overlap inj'(t) 

In this case, the next configuration of the chain, 
1), is configuration j’it). The necessary modifica- 

4 When we speak of computing a distance between two molecules 
we always mean the computation of the square of the distance; no 
square root procedures are required. 
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tion of the fo-Table and of the ju-Tables has been de- 
scribed under Step 4. The only further complication 
appears in the modification of the C-Table. Evi- 
dently what is required is to remove the interactions 
I! * [£(*); j(t)] — r[/x(/3(0, y); j(t)]\\ which have been 
tallied into the C-Table at earlier t, and replace them 
by the interactions tallied into the E-Table during 
Step 4. The present code requires the recomputation 
of these j(t) interactions at this point; current improve- 
ments will add to the redundant information already 
carried a set of ^(a) -Tables recording for each y) 

interaction in j(t) the corresponding interval in the 
C-Table. In any case, the procedure consists of a tally 
of these j(t ) interactions into the E-Table, but with a 
negative unit increment; at the Conclusion of this pro- 
cedure the E-Table then contains the desired changes in 
the C-Table, so that the next procedure is C+E— >C. 
The modification of the r-Table is simply r [/?(£) ; j'(t) ] 
— >r[j 8(t);j(t)]. If the “refresh” procedure for modifying 
the /x-Tables was required in Step 4, we now replace 
ju [/?(£)] by the TT 2 -Table described there. This delay is 
necessary since n[(3(t)] must be used for the computa- 
tion of j(t) interactions. 

We have so far not established a good criterion of the 
rapidity of the convergence process, other than simply 
a visual inspection of the trend of the results. In order 
to obtain some quantitative criterion for the choice of 
the parameter 8, we calculate the sum of the squares of 
the distances in 3 N dimensional space between succes- 
sive configuration points: 

m = e \\rw)-,m) - - D)ll ! - (22) 

f' = 1 

V 

At present about 7 per cent of the calculation time is 
spent in Step El. 

E2) Overlap in j' (t) 

In this case =j(t), no changes in the tables are 

required, and there is no contribution to <p(t). 

F. Census and Checking Procedures 

For convenience in monitoring the progress of the 
calculation, averages of the C-Table are taken over suc- 
cessive intervals of fixed length in time t, as well as over 
the entire length of the chain. For the former purpose an 
A -Table is carried which is set to zero at the beginning 
of each time interval, and at successive time steps is 
incremented by the C-Table. For the over-all average an 
C-Table is carried which is augmented by the A-Table 
at the end of each time interval. The cumulative distri- 
bution function n(r), (5), is then estimated by 

n(P + v • 2 s ) = — S S(v'; t). (23) 

Nt „' = o 

Numerical differentiation (performed later by hand) 
then gives g(<r), and the pressure p. 


Numerous checks are built into the problem, particu- 
larly in the “refresh” routine in Step 4, and at the 
census intervals just described, where the current 
C-Table is checked by calculating all N(N — 1)/2 inter- 
actions. These procedures have proven to be very 
worthwhile in giving notice of the occurrence of machine 
errors. 

About 12 per cent of the calculation time is involved 
in these procedures. 

IV. Results and Discussion 

The Monte Carlo method was previously applied to 
the system of hard spheres by Rosenbluth and Rosen- 
bluth [9]. However, the results obtained by Alder and 
Wainwright [6] at an early point in their kinetic investi- 
gation suggested that the Rosenbluth results might be 
partly in error, so that we undertook a concurrent re- 
investigation of the question. The new results shown in 
Fig. 2 are indeed in rather good agreeement with those 
of Alder and Wainwright. 

The new Monte Carlo results have been described 
briefly elsewhere [9] from the standpoint of their sta- 
tistical mechanical interest. Here we shall consider them 
briefly from the standpoint of computational interest, 
concentrating on aspects of the behavior which are in 
some sense exceptional. 

Plotted as the reduced pressure pVo/kT vs the re- 
duced volume r, where z> 0 is the close-packed volume per 
molecule (i> 0 = <r 3 /-\/2) , the results fall on two distinct 
curves, as shown in Fig. 2. This behavior may well be 
related to the presence of a first-order phase transition ; 
we leave aside this point in the present discussion. In 
the region r<1.6 the figure shows do able- valued pres- 
sures. In the region r = 1.52-160 these arise because of 
the behavior of a representative single chain shown in 
Fig. 3. These chains show in addition to the usual sta- 
tistical fluctuation, a tendency to oscillate between two 
rather well-defined classes of states, with interclass 
transitions being so rare (about four hours of 704 time 
were required to generate the chain pictured in Fig. 3) 
as to make the over-all averages very poorly determined 
within a single chain, and differing widely in independ- 
ent chains generated for the same values of N and r. If 
the two classes are averaged separately, however, the 
inter- and intrachain agreement is quite good; it is 
these separate averages which are plotted in Fig. 2, and 
which give rise to the double-valuedness. It is note- 
worthy that Alder and Wainwright observe the same 
rare interclass transitions. 

In the course of investigating the effect of varying the 
parameter 8, we generated 14 different Markov chains 
for the 32-molecule system at r = 1.55, the chain shown 
in Fig. 3 being one of these. These chains comprised a 
total of 12,800,000 configurations, the longest chain in 
the set containing 3,500,000 configurations and the 
shortest, 560,000 configurations. A total of 14 well- 
defined interclass transitions were observed ; four chains 
produced no transitions, eight produced one transition, 
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• Monte Carlo, N—32 
▲ Monte Carlo, N= 256 
+ Alder and Wainwright [6], iV = 32. 

Fig. 2 — For the sake of clarity many of the calculated points have been 
omitted from the figure, particularly those of Alder and Wain- 
wright for N— 108 and 256. 



Fig. 3 — The number of molecules inside a sphere of radius 1.026 a 
around a representative molecule, averaged over intervals of 
19,200 configurations, plotted against the total number of con- 
figurations t, ina representative chain at t = 1.55, N=3 2, 5 = 0.051. 
The over-all average of these points should converge to the cumu- 
lative distribution function n (1.026 tr), but the convergence is 
evidently very slow due to the secular fluctuation between the 
groups of high- and low-pressure states. 

and two chains showed three transitions. Grouped all 
together, the results suggest that a transition occurs on 
the average of about once in 900,000 configurations. 
These 14 chains gave values of pv 0 /kT (averaged over 
the entire length of each chain, not the separate high- 
and low-class averages mentioned previously) ranging 
from 5.45 to 6.92, with the average over all 14 chains 
being 6.23. This value of the average is obtained for two 
different weightings: 1) each configuration of every 
chain is given equal weight; or 2) each chain is weighted 
with its final value of <f>(t) (22). The calculated standard 
deviation of this average is 0.11. We do not wish to 
stress the significance of this average, though it is inter- 
esting that it lies midway between the two curves of 
Fig. 2. 


We interpret this transition behavior to be due to a 
compartmentalization of phase space (see the discussion 
of this point in Section II) into two regions which are 
narrowly connected in the region 1.52 <r< 1.60. At 
larger values of r the distinction between the two 
regions disappears rather quickly, and the over-all aver- 
ages are easily determined. At smaller values of r 
neither we nor Alder and Wainwright have observed any 
interclass transitions, and it is not known whether the 
two regions of configuration space corresponding to the 
two branches of Fig. 2 are in fact connected or not for 
r<1.52: the lower curve is obtained when the initial 
configuration is taken as the regular lattice arrange- 
ment; the upper curve was obtained from initial con- 
figurations generated by a step-wise compression of the 
system ( i.e ., gradually increasing a as t increases), start- 
ing from a randomly chosen configuration from the up- 
per class in the region r = 1.55-1.60. The behavior does 
not change as N is varied, and is therefore probably 
not an artifact of the periodic boundary condition. 

For optimum calculation time it has been found best 
to choose di/a~ 1.3 for N = 3 2 , ==1.45 for N = 256, both 
for t~ 1.5. The choice of this parameter does not affect 
the sequence of configurations generated in the chain, 
but only the rate (in terms of machine time) at which 
they are generated. The parameter 8, on the other hand, 
affects both the calculation rate and the motion through 
configuration space. The dependence of the optimiza- 
tion parameter [<t>(t)/t] 1/2 /9 on the choice of 8 is shown 
in Fig. 4, for the 32-molecule system at r = 1.55. Here 9 is 
the average actual machine (IBM 704) time required 
for the generation of one configuration in the chain, and 
[4>(t)/t] 112 (taken for large t) is the root mean square dis- 
placement per configuration. Thus the maximum in 
Fig. 4 corresponds to a maximum movement through 
configuration space per unit of machine time, on the 
basis of the root-mean-square displacement criterion ; it 
occurs at a value of 9=20 milliseconds/configuration. 
Also shown in Fig. 4 is the behavior of the “reject ratio” 
R, which is the fraction of recurring configurations due 
to overlaps in j'{t). It is rather striking that the appar- 
ent optimum calculation rate occurs at a rather large 
value of i?(~0.90). So large an optimum value of R was 
something of a surprise to us, but is at least partially 
supported by our qualitative impressions of the con- 
vergence of the radial distribution function, and prob- 
ably also by the frequency of interclass transitions, 
which appears to be greatest at similarly large values 
of R. 

It is of some interest to inquire how the rate at which 
the 704 calculator moves the state point through con- 
figuration space compares with that at which an actual 
molecular system executes its thermal movement, even 
though the two processes are of radically different char- 
acter. If we take the temperature to be 25°C, the molec- 
ular diameter a = 3.5 X 10 -8 cm and the molecular mass 
as 40 atomic weight units (values which are approxi- 
mately correct for the argon molecule), we find that a 
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(a) 


i 

e 



(b) 

Fig. 4 — The reject ratio R and calculation rate \Z<j>/t/0 (in units of 
the linear edge of the Monte Carlo cell V, per second of IBM 704 
calculator time) as functions of 5, for N=3 2 and r = 1.55. In (a) 
the dashed curve is from the high pressure class of states, the 
solid curve from the lower pressure class. Similarly in (b): • lower 
pressure class; X high pressure class. 


system of 32 molecules under these conditions would 
have a rate of root-mean-square displacement about 
3 X 10 12 times the maximum in Fig. 4(b). Thus we should 
not be surprised at the rareness of the interclass transi- 
tions during the machine calculation — on the scale of 
physical time they are relatively frequent, perhaps 
several hundred per microsecond. For the same values 
of the parameters, we can also compare directly the 
physical collision rate, which is estimated as 4X10 14 col- 
lisions per second in a system of 32 molecules, with 
Alder and Wainwright’s [6] calculation rate, which is 
about two collisions per second. Thus we see that in 
either case, the calculational procedures are much 
slower than the actual molecular system. One should 
not conclude from these figures that the Monte Carlo 
procedure is more efficient than the Alder and Wain- 
wright procedure, since the root-mean-square displace- 
ment criterion is a very crude basis for comparison. 
Empirically, in terms of machine time, they seem to 
produce the interclass transitions at roughly the same 
rate. 

The most important task remaining is to establish the 
significance of the two branches of the p-V curve at 
r < 1.5, i.e., the connectivity and relative volumes of the 
corresponding regions of configuration space. Aside 
from possibly developing some more complicated sto- 
chastic approach, the chief hope of further progress lies 
in increasing the calculation speed. We have two ap- 
proaches in mind, one with respect to the method, the 
other involving the use of faster machines. 


A possible improvement in method lies in the question 
of how fine a subdivision of configuration space is re- 
quired in order for a digital calculation to give results 
practically equivalent to the continuum of states im- 
plied by the analytical integral in (1). As already 
mentioned, we currently use (2 17 ) 3 = 2 51 points in 3N 
dimensional space. If the number of bits required to 
represent the molecular coordinates can be reduced 
from the present 17 to perhaps 13-14 or less, then in the 
computation of the intermolecular distances we can re- 
place arithmetic multiplication by a faster table-look-up 
process. With this and other similar modifications we 
hope to gain a considerable factor, but certainly less 
than 10, of improvement in speed. 

The “Stretch” calculator currently under contract 
with IBM, for which coding is now in progress, will in- 
crease the calculation rate by a factor of 20-50, and we 
hope that the combination of improved programming 
and faster calculator will make possible a determination 
of the over-all average in the region near r = 1.55. In 
another direction in computer design, it is probably 
worthwhile to mention that this problem falls in the 
class of those for which parallel computation by several 
arithmetic units with common memory and control 
could result in a considerable increase in calculating 
rate. 

We shall close with a brief mention of some applica- 
tions in which the results have been interesting without 
being confusing. For the hard sphere system at r>1.6, 
where the chains are well convergent, the results are be- 
lieved to be essentially exact and have been of consider- 
able utility in calibrating various theoretical approxi- 
mations leading to analytical treatments. At sufficiently 
low densities the method gives results in agreement 
with the “virial expansion” whose first five coefficients 
are known [9]. On the other hand, at sufficiently high 
densities (t< 1.3) the lower of the two pressures calcu- 
lated from the two separate classes agrees increasingly 
well with the “free-volume” approximation. This sug- 
gests that the latter may be asymptotically correct at 
high densities. 

In the case of “Lennard- Jones molecules,” the method 
yields results [l] which applied to argon agree very well 
with experimental observations at pressures below 
about 4000 atmospheres, over a region where no ana- 
lytical approximation has given agreement. At higher 
pressures there is disagreement, with a suggestion that 
the experimental observations may be in error. If the 
disagreement is real it has interesting consequences in 
the study of intermolecular forces. Just as with hard 
spheres, there is agreement with the free-volume ap- 
proximation at very high densities. 

The method has also been tested successfully on the 
so-called “lattice gas” with nearest neighbor interac- 
tions, by comparison with known analytical results [4], 

We hope in the future to apply the method to systems 
of molecules of different kinds (mixtures) and to non- 
spherical molecules. 
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Real-Time Digital Analysis and Error- 
Compensating Techniques 

WALLY ITOf 


A Particular Example Illustrating the 
Analysis Technique 

T HE techniques which are described in this paper 
apply to the digital mechanization of any system 
represented as an ordinary linear differential 
equation with constant coefficients — inhomogeneous or 
homogeneous. I 

As an example, consider the sinusoidal loop as gen- 
erated by a DDA excited by a sampled external forcing 
function, /(/), as shown in Fig. 1. It is 

X + coo 2 # = /(/) ( 1 ) 

which can also be written as 



Fig. 1 — -DDA loop for aj+wo 2 x=f(t) with initial 
conditions x(0) and £(0). 


/ * t /% t 

x(r)dr = I f(j)dT. 
n An 


(2) 


However, reference to Fig. 1 reveals that the DDA 
lashup does not mechanize (2) but does mechanize 

t t t ti 

5 fo(r )dT ~ 5* (ovr+ “° j s s x D (r)dTdti 


S /. 


f(r)drdti (3) 


where ^ represents the integration operation used in 

o 

the computer. 

Let ^ represent the rectangular rule for integration, 

0 

and designate the Laplace transform of by q R (s) 

o 

f Res. Staff Engr. Aeronaut. Div., Minneapolis-Honeywell Reg- 
ulator Co., Minneapolis, Minn. 


= h / (l-e _sfe ) where h is the sampling interval in seconds. 
Taking the transform of (3), we obtain 


xd(s) 


*(Q) j*(Q) 7 

s 2 + oo 0 2 sq R s 2 + co 0 2 sq R s 2 + c o 0 2 sq R 


(4) 


where Xd(s ) is the Laplace transform of the digital solu- 
tion, XD(t). Consider then the inverse Laplace trans- 
forms: 


M D (t) = Z- 1 


1 

= L - 1 

" 1 

-S 2 + o >0 2 sq R _ 

sh 


s 2 + co 0 2 

L 1 — e~ sh 


and 


N D {t) = Zr 1 


\ * 1 

= L- 1 

s 

\_s 2 + coo 2 sq R _ 

sh 


s' 2 T co o 2 

1 — e~ ih _ 
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If we use the notation 
Md(s) = 


r 1 1 


1 

L^ 2 + coCsqR_ 


sk 


s 2 + coo 2 

1 — e~ sh 



(5) 


then we can write 


coo 2 /*-Mn 

M d + 


1 


5(1 — e sh ) s 2 


( 6 ) 


Multiplying both sides of ( 6 ) by the factor (1—e sh ) 2 , 
we obtain 

Cdo 2 hM D 1 

M d { 1 - e~ sh Y + — (1 - e - sh ) = — (1 - e ~ sh y. (7) 

s s 2 

Inverse Laplace transforming (7), we obtain 
M D (t ) - 2M D (t ~ k) + M D (t - 2k) 


+ co 0 2 /* f M D (r)dr = 0 (8) 

J t-h 

the integro-difference equation for Md(£). 

Imposing the mathematical constraint that, except at 
/ = 0, h, 2 h, • • • , kh, Md(I) is a constant staircase-type 
function, we see that the integro-difference equation 
(8) reduces to 

M D (t) - (2 - coo ?h)M D (t - h) - f M D (t - 2k) = 0 (9) 

a linear, homogeneous, difference equation with con- 
stant coefficients for the eigenfunction, Md(I). 

It turns out that 


M D (t) = 


sin ( 7 ) 


sin ( 7 /), 


( 10 ) 


where 


sin ( 7 /) = sin ( 7 kk) for kk < t < (k + 1 )/* 
k = 0 , 1 , 2 , • • • 


and 


7 = — arctan 

k 


' kcoo \/ 4 — coo 2 /* 2 ’ 

. 2 - coo 2 /* 2 J 


In a similar fashion, it can be shown that the other 
eigenfunction is given by 


N D (l) = L- 


r s 

-1 

_S 2 + COO 2 SQR- 


s 2 + coo 2 s<Zr_ 

1 — coo 2 h 2 — cos ( 7 )' 


= COS (yt) + 


sin ( 7 /) ( 11 ) 


sin ( 7 ) 

where cos (y£) = cos (ykh) for kk<t<(k-\-\)h 
k = 0 , 1 , 2 , • • • . 

Thus, we see that the digital mechanization of 

X + co 0 2 x = f(t) (1) 


by a DDA employing the rectangular rule yields a 
digital solution given by 

%]>(t) = x(0 )Md(I) H - x(0)N o(t) + f f(r)Mn(t — r)dr 

J 0 


k 

= i( 0 ) sin (yt) 

sin 7 


+ 


:r( 0 ) 


cos (yt) + 


( 


1 — coo 2 k 2 — cos y\ “I 

-J sin ( 7 /) I 


sin 7 


+ “7 f f(r) sin [y(t - r)]dr. 

sm 7 J 0 


( 12 ) 


is 


Methods of Compensating for Errors in the 
Digital Solution 

The exact solution for 

x + coo 2 # = f(t) 

x( 0 ) sin (co 0 1) 

x(t) = (- a;( 0 ) cos (co 0 /) 

co 0 


( 1 ) 


1 r 1 

H I f(r) sin [co 0 (/ - r)dr} 

COnt/ 0 


0)0 0 

as contrasted with the digital solution 

x(0)k -j- [l — coo 2 /* 2 — cos (yk)~\x(G) 

x D (t) = ; — 7 — sin (yt) 

sm (7 h) 

+ cr( 0 ) cos ( 7 /) 

+ — ■ - f f(r ) Sin [7(^ - r)]c/r. (13) 

sm (yh) J 0 

This comparison of exact and digital solutions suggests 
three successively applied techniques for compensation 
of digital truncation errors. 

The first and most obvious compensation we can 
introduce is to force 7 to equal coo. We do this by using a 
value coi 2 in the serial multiplier so that coi 2 satisfies the 
relationship 


coo = (1 /k) arctan 


/*C0i\/ 4 — COl 2 /* 2 ’ 


COl 


2 /* 2 


(14) 


The partially compensated digital solution then takes 
the form 


x D '(t) 


‘x(0)k + (1 — on 2 /* 2 — cos coi/*)x(0)' 


sin co 0 /* 


sin (coo t) 


+ x( 0 ) cos (co 0 /) 
k 


+ - f fij) sin [co 0 (/ - r)]dr. (15) 

sm (co 0 /*) J 0 

The solution xN(t) suggests that a well-chosen but in- 
correct value for x( 0 ) be used called ii( 0 ), so that 


xi(0)k T (1 — coi 2 /* 2 — cos coi/*)x(0) x(0) 


sin coo k 


( 16 ) 


coo 
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With these two digital compensations, the digital solu- 
tion looks like 


x(0) sin (coot) 

xd" ( t) = f- #(0) cos (coo t) 

COo 

hr 1 

H — — — — I f(r) sinco 0 (i t - r)dr. (17) 

sm (a) oh) J o 

The third and last compensation technique consists 
of multiplying/ by a constant, c — c{h ), so that 


c(h) 


1 


sin (oooli) coq 


(18) 


With the three compensations introduced, the digital 
solution is given by 

x(0) sin (cooO 

x D (t) = b x(0) cos (co 0 /) 


COQ 


1 r * 

H I /( r ) sin [co 0 (^ - r)]di 

COo J 0 


(19) 


as contrasted with the exact solution given by (13). The 
only errors left are those generated by the convolution 
integral 


1 n t 

— I /(r) sin [«„(< ~ T)]dr. 

con J 0 


+ x(0) e at cos (y t) + 


e~ ah _ cos (yhy 


he~ ah 

_j_ __ e 


sin (y h) 


sin (y h) 

Mt f f( T ) si n [y(t ~ T )]^ 7 

j 0 


e at sin (y/)| 

(20) 


where 


l r coo 2 AH 

Tk log V + —\ 


y = — arctan 
h 


' cooh 


V16 - coo 2 h 2 


] 


The exponential term, e a< , where a is positive as indi- 
cated, reveals that the truncation errors, incurred by 
use of the trapezoidal algorithm increase exponentially 
whereas there is no such tendency toward instability 
introduced by the rectangular algorithm. 

Another source of instability can be delays between 
successive integrations. For example, suppose we pur- 
posely delay one integration, say, one sampling period, 
behind the other integration. That is, suppose we mech- 
anize according to 

t t 

^ xj)(j)dT — ^ x(0)dr 
o o 

t l-h 1 

s s Xj)(T)drdt\ — s /: * f(j) dr dt (21) 


+ 0)q 


rather than by (3). 

Taking the transform of (21) and assuming the 
rectangular rule, we obtain 


Xd{s) 


x(0) 


sx (0) 


s 2 + co 0 2 (sf R )e~ 
f(s) 


s 2 + o) 0 2 (sf R )e 


+ 


COo J 0 

Instability Can Be Introduced by Wrong Choice 
of Algorithm or by Delay Between 
Integrations 

If the trapezoidal algorithm is used instead of the 
rectangular algorithm for integration in the digital 
mechanization of 

X + COo 2 # = f(t) 

the following expression is obtained as the digital solu- 
tion: 

r he~ ah “1 

x D (t) = x(0) - — — — e at sin (y t) 

Lsm (y h) J 


s 2 + c oo 2 (sf R )e hs 
The eigenfunctions corresponding to 

1 


s 2 + O) 0 2 (sf R )e 


and 


s 2 + oo 0 2 (sf R )e 


are 


exp 


dOge VI + O) 0 2 h 2 
- 


t X 


cos 


/arctan (hcoo) 
h 


t 


and 


'lOge VI + O) 0 2 /* 2 


h 


) r . /arctan (hco 0 ) \~ 

x s * n I ^ / ’ 


The eigenfunctions show that the effect of a one- 
sampling period delay between integrations is to intro- 
duce a positive exponential-type factor into the solu- 
tion — in other words, the solution is rendered inherently 
unstable. 
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Automatic Digital Matric Structural Analysis 

M. CHIRICO f, B. KLEIN f, and A. OWENSf 


List of Symbols 

A = cross-sectional area of bar. 

A max = maximum cross-sectional area of bar. 

A m in = minimum cross-sectional area of bar. 

A p = area of panel. 

-Bmax = length of longer parallel edge of tapered 
panel. 

-6 m in = length of shorter parallel edge of tapered 
panel. 

D x (i) — x component of displacement at the joint i 
in the coordinate system ( X , Y, Z). D y (i) 
and D z {i) are defined similarly. 

E — Young’s modulus. 

F x {i)—x component of the external force applied 
at the joint i. F y {i) and F z (i) are defined 
similarly. 

G = shear modulus. 

L{j = length of the bar with end joints i and j. 
Pij = load at the joint i on the bar B 
q = shear flow. 
q = average shear flow. 

8ij = the displacement of the joint i in the direc- 
tion of the bar 2Ly. 

A Xij — Xi — Xj. A yij and A are defined similarly. 
tn = thickness of panel at intersection of its 
diagonals. 
i,j, k , m, n = subscripts. 

Introduction 

T HE accelerated pace of computer development 
has made possible the rapid solution of complex 
problems. The time necessary to set up problems 
has begun to surpass greatly machine solution time. 
Consequently, more emphasis is needed on means of 
making use of digital machines and allied equipment 
in setting up problems automatically. 

Heretofore, the preparation of certain matric equa- 
tions appearing in structural analysis has been a tedious 
task. The procedures have required a large amount of 
judgment and tiresome hand computation. The chances 
for errors have been prevalent. 

The present paper presents a method whereby the 
above factors can be minimized or negated. Input data 
are reduced to a minimum. All logical decisions are 
carried out completely automatically so as to arrange 
the matrix automatically. Machine time is found to be 
very small relative to the time previously needed to set 
up problems. Therefore, this coded program should 
prove very useful to structures and allied engineers. 

f Convair Astronautics, San Diego, Calif. 


Some familiarization with Klein 1 - 2 is helpful but not 
necessary for the understanding of the development in 
this paper. The basic concepts are the ones of joint, bar, 
and panel. The joints connect the bars and the bars 
border the panels. 

Machine Simulation of Elements 

The code is built around the -basic elements: joints, 
bars, and panels. The information below noted by as- 
terisk (*) is computed. 

1) Joints 

A group of words is assigned to each joint contain- 
ing the following information: 

a) Its position coordinates x, y, and z 

b) Whether it is fixed 

*c) All the bars attached to the joint. 

2) Bars 

A group of words is assigned to each bar with the 
following data: 

a) Its two end joints 

b) Whether it is tapered 

c) Its cross-sectional area (both maximum and 
minimum, if tapered) 

*d) The direction cosines x, y, and z 
*e) The length of the bar La. 

3) Panels 

A group of words is assigned to each panel with 
information on: 
a) The four corner joints 
*b) The area 

c) The thickness 
*d) All of the bordering bars 
*e) B max and B min (lengths of the parallel sides) 
f) Whether it is tapered. 

The number of cells in a group must be multiples of 
eight to allow the use of multiple index registers. * ! A 
joint, bar, and panel, grouped together in 40 cells 'to 
economize on space, are not necessarily related. 

The Code 

A) Input 

The information without asterisks, described in 
the previous section, is read into the proper loca- 
tion. The numbers of the joints appearing in the 
bars and panels are converted into the two’s 
complement of that joint’s address. The joint then 
can be referred to with an index register. 

1 B. Klein, “A simple method of matric structures analysis,” 
J. Inst. Aeronaut. Sciences, vol. 24, pp. 40-46; January, 1957. 

2 B. Klein, “A simple method of metric structural analysis, part 
II — effects of taper and a consideration of curvature,” J. Inst. 
Aeronaut. Sciences, vol. 24, pp. 813-820; November, 1957. 
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B) Bar Cross Referencing 

1) The following is computed for each bar : 


a) Ax = %i 

Xj, 

(i) 

b) Ay = y t 

- y» 

(2) 

c) Az = Zi 

- Zj. 

(3) 


2) The bar address complement is stored in the 
groups of the joints i and j. 

3) Every panel is examined. If both the joints i 
and j are in it, the bar Bij borders it. In this 
case the two’s complement of the bar address 
is placed in the panel group and the two’s com- 
plement of the panel address is placed in the 
bar group. 

C) Joint Equations 

1) For each joint i the following force equilibrium 
equations are entered in the matrix 


a) If.,| 

i 

/ Ax \ 

\ L / 

| U = FJi ) , 

(4) 

b) E Pi, ( 

0 

(A y\ 

\ L / 

| = Fy(i), 

(5) 

c) E-p»( 

j 

( Az\ 

\ L / 

| = 

(6) 


D) Panel Area 

1) The two parallel bars for each panel are found 
by comparing the direction cosines. The bars are 
rearranged so that the longest parallel side is first 
and the other parallel bar is second. 

2) Twice the area A p is computed by the following 
(see Fig. 1). 

2 Ap h (-S m i n "J" -6m ax ) , W 

where 


h — La sin 0i (8) 

sin 0i = vT — cos 2 0i (9) 



The cosine above is equal to the dot product 
only if both sides are directed away from or to- 
ward the joint. If the sides are directed in op- 
posite ways, the dot product must be multi- 
plied by minus one. 

E) Panel Equation 

1) The shear panel displacement equation for the 
panel in Fig. 1 is 

(2 A j)/ Gtjd)Qjmki B max (8{j "T 8jm) ~j~ B miri 

— Lij(8ijA~dji) Lkm(8km-\-8mk) — 0 . ( 11 ) 



Fig. 1 — Shear panel. 


2) The arrows inside the panel indicate the direc- 
tion of shear flow. The arrows outside the 
panel indicate the direction of displacements. 

3) The corresponding panel and bar numbering 
for Fig. 1 is 

panel: P jmki 

bars: B mj , B ki , Bij, and B km . 

4) The panel numbering determines the positive 
direction of the shear flows, which are directed 
toward the joints designated by the first and 
third subscripts of the panel. (See Fig. 1.) The 
bar numbering determines the direction of 
positive displacement of the joint which is 
toward the joint determined by the first sub- 
script of the bar. 

5) The signs of the terms in the equation are: 

a) Shear term. This sign is minus. 

b) Displacement term. If the direction of 
positive displacement is the same as the 
direction of positive shear flow along a side, 
this sign is plus; if they are opposite, the 
sign is minus. 

6) The code determines the signs by examining 
the joint numbers. If the first bar subscript is 
equal to the first or third panel subscript, the 
sign is plus; otherwise, it is minus. 

7) The length of the side is the factor outside the 
parenthesis in the displacement term. Note 
that the length of the opposite side is used in 
the case of the parallel sides. 

F) Bar Equations 

1) The axial element equilibrium equation for 
each bar is 

Pn - Pi, + E w±j.) = o. (12) 

n 

2) When the bar Bij is one of the parallel bars 
bordering a panel, b n of (12) is the length of - 
the other parallel bar; otherwise, it is the 
length of the bar Bij. 

3) The sign of a shear term is positive if the posi- 
tive direction of the shear along an edge of the 
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panel and the positive direction of the dis- 
placement in the bar bordering that edge are 
opposite; otherwise, the sign of the shear term 
is negative. 

4) The axial force displacement equation for a 
tapered bar is 


Pij + Lij ^2 ( + q n )f(Bmin/B max? 


^4 max /A min) 


+ (EA/ Lij) (8ji - 8ij) = 0 (13) 


where Pa is the force at the narrow end of the 
bar. The sign of the shear term is plus if the 
shear and displacement directions oppose, and 


f(B n iin/7*max, -<4 max / A min) 


(A . / A ^0.175 

V^Mnm/ ^max; 

l + (5 min /3 max ) 0 - 7 


(14) 



10 * , 10 * 


Y 

Fig. 2 — Example problem. 


A — -4min + 


( A . I A \ 0.175 

^mim/ n max\ / 

2 ) 


(dmax -4m in ) (15) 


if the taper is linear. 

5) For a nontapered bar the equation is 

Pij + Pji + (2EA/Lij)(8ji - 8ij) = 0. 
G) The Flow Chart follows. 


(16) 



Example Problem 

The structure in Fig. 2 is analyzed by the code. The 
complete input data describing the structure appears 
in the next section, which is followed by the matrix 
generated and the solution with the matrix column 
numbers. The elements without matrix column numbers 
are obtained by a simple calculation from matrix com- 
puted elements; e.g., Delta 1-7 equals Delta 1-3. 

Total computing time, i.e., the time for both matric 
setup and solution, is about 0.01 hour. 

Problems of a much more formidable nature have 
been arranged and solved by the program, and work is 
in progress for improving the code. For example, use of 
instantaneous coordinates may create many more zero 
elements in the matrix. 



JOINT 

3 

9 

X 

0.0 

0.0 

¥ 

1000.0 

1000.0 

Z 

0.0 

0.0 


JOINTS 

NUMBER 

X 

Y 

Z FIXED 


1 

- 2.5 

16.0 

0.1 

NO 


2 

- 3.75 

8.0 

0.1 

NO 


3 

- 7.5 

16.0 

0.1 

NO 


4 

-11.25 

8.0 

0.1 

NO 


5 

- 5.0 

0.0 

0.1 

YES 


6 

-15.0 

0.0 

0.1 

YES 


7 

2.5 

16.0 

0.1 

NO 


8 

3.75 

8.0 

0.1 

NO 


9 

7.5 

16.0 

0.1 

NO 


10 

11.25 

8.0 

0.1 

NO 


11 

5.0 

0.0 

0.1 

YES 


12 

15.0 

0.0 

0.1 

YES 

BARS 

JOINT 

1 JOINT 2 

TAPERED 

Am ax 

Amin 

E 

3 

4 

YES 

1.0 

0.66667 

1.0 

4 

6 

YES 

1.3333 

1.0 

1.0 

9 

10 

YES 

1.0 

0.66667 

1.0 

10 

12 

YES 

1.3333 

1.0 

1.0 

1 

2 

YES 

1.0 

0.66667 

1.0 

2 

5 

YES 

1.3333 

1.0 

1.0 

7 

8 

YES 

1.0 

0.66667 

1.0 

8 

11 

YES 

1.3333 

1.0 

1.0 

3 

1 

NO 

0.66667 

0.66667 

1.0 

4 

2 

NO 

1.0 

1.0 

1.0 

9 

7 

NO 

0.66667 

0.66667 

1.0 

10 

8 

NO 

1.0 

1.0 

1.0 

1 

7 

NO 

0.66667 

0.66667 

1.0 

2 

8 

NO 

1.0 

1.0 

1.0 
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PANELS 


ber' J° int 

JOINT JOINT 

J° INT SHEAR 

THICK- 

NESS 




1 3 

4 2 

1 0.4 

0.07 




2 4 

6 5 

2 0.4 

0.1 




3 1 

2 8 

7 0.5 

0.07 




4 2 

5 11 

8 0.4 

0.1 




5 7 

8 10 

9 0.4 

0.07 




6 8 

11 12 

10 0.4 

0.1 




MATRIX 







ROW 

COL 

VALUE 


ROW 

COL 

VALUE 

1 

1 

0.42443388 


20 

19 

-7.5 

1 

2 

-0.42443388 


20 

37 

-5.0 

1 

3 

1.0 


20 

43 

-5.0 

2 

1 

-0.90545894 


21 

38 

5.0 

2 

2 

0.90545894 


21 

35 

-1.0 

3 

4 

-1.0 


21 

10 

-1.0 

3 

5 

0.90545894 


21 

23 

1.0 

3 

6 

-0.42443388 


22 

30 

-0.26666667 

4 

7 

0.15437688 


22 

37 

-0.26666667 

4 

8 

-0.15437688 


22 

10 

1.0 

4 

9 

-1.0 


22 

23 

1.0 

4 

10 

1.0 


23 

38 

-7.5 

5 

7 

-0.98801203 


23 

13 

-1.0 

5 

8 

0.98801203 


23 

26 

1.0 

6 

11 

-0.15437688 


24 

34 

-0.26666667 

6 

12 

-1.0 


24 

40 

-0.26666667 

6 

13 

1.0 


24 

13 

1.0 

7 

11 

0.98801203 


24 

26 

1.0 

8 

14 

-0.42443388 


25 

32 

-5.0 

8 

15 

0.42443388 


25 

27 

10.0 

8 

16 

-1.0 


25 

9 

-1.0 

9 

14 

-0.90545894 


25 

3 

1.0 

9 

15 

0.90545894 


26 

30 

0.26666667 

10 

17 

-1.0 


26 

31 

-0.26666667 

10 

18 

0.90545894 


26 

9 

1.0 

10 

19 

0.42443388 


26 

3 

1.0 

11 

20 

-0.15437688 


27 

32 

7.5 

11 

21 

0.15437688 


27 

12 

-1.0 

11 

22 

1.0 


28 

34 

0.26666667 

11 

23 

-1.0 


28 

6 

-0.26666667 

12 

20 

-0.98801203 


28 

12 

1.0 

12 

21 

0.98801203 


29 

44 

5.0 

13 

24 

0.15437688 


29 

41 

-10.0 

13 

25 

1.0 


29 

22 

-1.0 

13 

26 

-1.0 


29 

16 

1.0 

14 

24 

0.98801203 


30 

37 

0.26666667 

15 

27 

-3499.9999 


30 

43 

-0.26666667 

15 

28 

-8.8352984 


30 

22 

1.0 

15 

29 

8.0970674 


30 

16 

1.0 

15 

30 

-10.0 


31 

44 

-7.5 

15 

31 

-10.0 


31 

25 

-1.0 

16 

32 

-3571.4285 


32 

40 

0.26666667 

16 

28 

-8.8352984 


32 

19 

0.26666667 

16 

4 

-8.8352984 


32 

25 

1.0 

16 

29 

8.0970674 


33 

35 

8.0970674 

16 

33 

-8.0970674 


33 

27 

-8.0970674 

16 

34 

-7.5 


33 

45 

-1.0 

16 

6 

-7.5 


33 

8 

1.0 

16 

30 

5.0 


34 

35 

4.2360996 

16 

31 

5.0 


34 

27 

-4.2360996 

17 

35 

-3499.9999 


34 

29 

-0.1430726 

17 

36 

8.0970674 


34 

8 

1.0 

17 

29 

-8.0970674 


35 

38 

8.0970674 

17 

30 

-1.0 


35 

32 

-8.0970674 

17 

37 

1.0 


35 

7 

-1.0 

18 

38 

-3571.4285 


35 

11 

1.0 

18 

36 

8.0970674 


36 

38 

4.3028433 

18 

39 

8.0970674 


36 

32 

-4.3028433 

18 

29 

-8.0970674 


36 

33 

-0.10150823 

18 

33 

-8.0970674 


36 

29 

0.10150823 

18 

34 

-7.5 


36 

11 

1.0 

18 

40 

7.5 


37 

41 

8.0970674 

18 

30 

5.0 


37 

35 

-8.0970674 

18 

37 

-5.0 


37 

46 

-1.0 

19 

41 

-3499.9999 


37 

21 

1.0 

19 

42 

8.8352984 


38 

41 

4.2360996 

19 

36 

-8.0970674 


38 

35 

-4.2360996 

19 

37 

10.0 


38 

36 

-0.14307260 

19 

43 

10.0 


38 

21 

1.0 

20 

44 

-3571.4285 


39 

44 

8.0970674 

20 

42 

8.8352984 


39 

38 

-8.0970674 

20 

17 

8.8352984 


39 

20 

-1.0 

20 

36 

-8.0970674 


39 

24 

1.0 

20 

39 

-8.0970674 


40 

44 

4.3028433 


20 40 7.5 40 38 - 4.30284 33 
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ROW 


COL 



VALUE 

ROW 

COL 



VALUE 

40 


39 



-0.10150823 

46 

41 



-4.6223159 

40 


36 



0.10150823 

46 

42 



-0.13111821 

40 


24 



1.0 

46 

15 



1.0 

41 


27 



8.8352984 

47 

44 



-8.8352984 

41 


47 



-1.0 

47 

14 



-1.0 

41 


2 



1.0 

48 

44 



-4.6951448 

42 


27 



4.6223159 

48 

17 



-0.093026737 

42 


28 



-0.13111821 

48 

42 



0.093026737 

42 


2 



1.0 

27 

49 



-468.75001 

43 


32 



8.8352984 

28 

49 



-468.75001 

43 


1 



-1.0 

31 

49 



-468.75001 

44 


32 



4.6951448 

32 

49 



-468.75001 

44 


4 



-0.903026737 

43 

49 



-1104.4123 

44 


28 



0.093026737 

44 

49 



-1104.4123 

45 


41 



-8.8352984 

47 

49 



-1104.4123 

45 


48 



-1.0 

48 

49 



-1104.4123 

45 


15 



1.0 
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MATRIX 






MATRIX 





COLUMN 


ELEMENT 

VALUE 
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ELEMENT 

VALUE 

NUMBER 






NUMBER 





24 

P 


1 

2 

0.0 

35 

Q 

4 


0.0 

25 

P 


1 

3 

1631.4448 

32 

Q 

5 


-407.4740 

26 

P 


1 

7 

1631.4448 

27 

Q 

6 


-105.1129 

20 

P 


2 

1 

3299.3447 

39 

DELTA 

1 

2 

43445.283 

22 

P 


2 

4 

986.2406 

40 

DELTA 

1 

3 

6117.8879 

21 

P 


2 

5 

3299.3447 


DELTA 

1 

7 

6117.8879 

23 

P 


2 

8 

986.2406 

36 

DELTA 

2 

1 

26172.822 


P 


3 

1 

418.5939 

37 

DELTA 

2 

4 

3698.4027 


P 


3 

4 

986.2406 


DELTA 

2 

8 

3898.4027 

16 

P 


4 

2 

0.0 


DELTA 

2 

5 

26172.822 

14 

P 


4 

3 

7443.9683 


DELTA 

3 

1 

29813.814 

15 

P 


4 

6 

7443.9683 

17 

DELTA 

3 

4 

151221.70 

46 

P 


5 

2 

4150.4514 

43 

DELTA 

4 

2 

7396.8052 

48 

P 


6 

4 

6515.2640 

42 

DELTA 

4 

3 

53067.405 

13 

P 


7 

1 

1631.4448 


DELTA 

4 

6 

53067.405 

11 

P 


7 

8 
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DELTA 

7 

1 
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12 

P 


7 

9 
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33 

DELTA 

7 

8 
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10 

P 


8 

2 

986.2406 

34 

DELTA 

7 

9 
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7 

P 


8 

7 
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DELTA 

8 

2 

3698.4027 


P 


8 

10 

986.2406 

29 

DELTA 

8 

7 
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8 

P 


8 

11 
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30 

DELTA 

8 

10 
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9 

7 
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8 

11 
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9 
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9 

10 
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9 
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8 
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10 

8 
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28 
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10 

9 
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45 
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11 

8 

4150.4514 
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10 

12 
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47 
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12 

10 

6515.2640 
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DX 
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1 


407.4740 

18 

DY 

3 
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Q 


2 
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6 

DX 
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38 

Q 


3 


0.0 

5 

DY 

9 
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A New Approach to High-Speed Logic 

W. D. ROWEf 


Introduction 

M OST approaches to the construction of high- 
speed computers and logic systems in the past 
have been first, to go from serial to parallel 
circuitry, and then to use faster and faster components 
to increase the operational speed of the circuitry. High- 
speed components have reached the state where we are 
not talking about switching times in the order of the 
light-foot. That is, if we have a switching circuit with a 
switching time of one millimicrosecond, the theoretical 
limit in which we can switch this information from one 
point to another requires that the separation distance 
be less than one foot. This is the theoretical limit 
imposed by the velocity of light and shows that we are 
rapidly approaching the limits of switching speed. 

It is therefore necessary that we look again at the or- 
ganization and utilization of logic techniques to deter- 
mine whether there are not other means of obtaining 
high-speed operation from logic circuitry, and thereby 
of bypassing the need for extremely fast components. 

It has long been recognized 1 that one solution to this 
problem would be the use of canonical logic forms of 
either the minterm (sum of products) or maxterm (prod- 
uct of sums) form. 2 The practical application of this 
technique has long been hampered by the lack of a suit- 
able electronic device capable of responding to the 
multiple input-output loadings required. This paper 
discusses a transistor device, the “Modified NOR Cir- 
cuit,” which is capable of handling up to 25 inputs and 
25 outputs with response times of about 80 m/xsec. This 
device is applied to the design of a high-speed adder and 
a high-speed counter. 

The design approach used here is termed “parallel- 
parallel” logic. This term arises from the fact that not 
only is the function constructed in parallel, but the logic 
is also constructed in parallel. A comparison of this logic 
with others is as follows : 

Serial Logic 

F unction — Serial 
Logic — Serial 
Parallel Logic 

F unction — Parallel 
Logic — Serial / 

Parallel-Parallel Logic 
F unction — Parallel 
Logic — Parallel 

f Westinghouse Electric Corp., Buffalo, N. Y. 

1 R. K. Richards, “Arithmetic Operations in Digital Computers,” 
D. Van Nostrand Co., Inc., New York, N. Y. ; 1955. 

2 M. Phister, Jr., “Logical Design of Digital Computers,” John 
Wiley and Sons, Inc., New York, N. Y. ; 1958. 


Fundamentals of Parallel- Parallel Logic 

With a logic circuit that has an infinite number of in- 
puts and outputs and if all input signals and their com- 
plements are available, logic arrays can be constructed 
using only one level of logic circuits. The advantage of 
this lies in the fact that the complete operation time of a 
logic array consists of only one logic circuit propagation 
time. This means that the maximum speed of a logic 
array is the same as the maximum operating speed of a 
single logic component. 

To explain this procedure, it can be shown that any 
logical operation can be written using conventional 
Boolean notations (where +indicates an OR operation, 
and • represents an AND operation, and — a comple- 
ment) as a sum of products or a product of sums, i.e., 

y = Xi'X 2 + Xi- x 2 — (#i + x 2 )(xi + x 2 ). (1) 

The left-hand expression in the equation may be rep- 
resented by a number of AND circuits (first level) work- 
ing into a single OR gate (second level) (Fig. 1), while 
the right-hand (equivalent) expression can be repre- 
sented by a number of OR gates (first level) working 
into an AND gate (second level) (Fig. 2). It is then ob- 
vious that if each logic circuit has no limit in its fan-in 
and fan-out, all logic can be done on two levels or a 
depth of two. Now, the OR operation is electronically 
unique, and can often be performed by a simple junc- 
tion of leads without any logic elements. In the case of 
the NOR logic element, to be discussed below, the OR 
operation need not be performed at all, since this ele- 
ment accepts multiple inputs and performs the neces- 
sary OR operation implicitly. 

If all signal complements are not available, a third 
level is required for negation. Since most signals come 
from bistable registers in parallel-parallel operation, 
both the signal and its complement are almost always 
available. The insertion of a third level in some inputs 
and not others can cause occurrence of certain race con- 
ditions if care is not exercised. Memory and counting 
circuits also require special consideration. 

Previous papers have discussed the use of a par- 
ticular universal logic circuit called a “NOR” circuit. 3 ’ 4 
The logic of this circuit is such that an output exists if, 
and only if, neither input A NOR, B NOR, C NOR, 
etc., is present. (See Fig. 3.) This circuit is universal in 
that it can perform all logic functions when combined 
in various forms with other NOR circuits. The right- 

3 W. D. Rowe, “The transistor NOR circuit,” 1957 WESCON 
Convention Record, pt. 4, pp. 231-245. 

4 W. D. Rowe and T. A. Jeeves, “The NORDIC II computer,” 
1957 WESCON Convention Record, pt. 4, pp. 85-95. 
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Fig. 1 — Representation of two-level logic — AND-NOR. 


A o 


B o 


-oC 


(a) 



(b) 

Fig. 3 — NOR logic; (a) diagrammatic NOR; (b) truth table. Logical 
expression: an output appears at C if neither input A nor input 
B is present. 



Fig. 2 — Representation of two- level logic — OR-AND. 



LEVEL 1 


LEVEL 2 


y - (*i + * 2 )(£i + * 2 ) 

Fig. 4 — Representation of two-level logic — NOR-NOR. 


hand expression of (1) can be expressed simply by opera- 
tion of several NOR circuits (first level) into another 
NOR circuit (second level). (See Fig. 4.) (Where the out- 
put of the array is fed to other similar circuits the second 
level may be omitted, as remarked above.) 

The particular advantage of using the NOR circuit 
instead of English logic circuits is twofold. First, only 
one type of circuit is required, so that only a single prop- 
agation constant is necessary; and second, it is possible 
to build NOR circuits with numbers of inputs and out- 
puts that are quite large, in the order of 25 inputs and 
25 outputs. This number is sufficiently large for many 
applications. 

Application of Parallel- Parallel Technique 
to a Full Adder 

The examples used so far are trivial in that they are 
two levels for both the parallel and the parallel-parallel 
cases. In order to illustrate more fully the advantages of 
the parallel-parallel method, a slightly more sophisti- 
cated example will be examined. 

This case is that of a full adder circuit. A basic full 
adder (Fig. 5) provides the sum of two addend variables 
and the carry variable from the preceding stage in 
binary form. The circuit is built up of two half-adders 


utilizing a maximum of two inputs per logic circuit. The 
depth is five, so that five propagation times are re- 
quired. 

The logic expression for this circuit is shown to be 

s = [xy + xy + c] [xy + xy + c], (2) 

which is derived directly by the substitution of the out- 
put of one-half adder which can be expressed in two 
equivalent ways: 

H = xy p xy (x + y){x -p y) (3) 

into the input of the second half adder, expressed as 

S=(H+C)(H+C) (4) 

to form (2). Here x and y are the two addend variables 
and C the preceding carry variable. 

Manipulation of this equation brings it into parallel- 
parallel form so that the equivalent expression is 

s = (x -f- y + c)(x + y + c) {x + y + c)(x + y + c). (5) 

The derivation of this equation from (2) is shown in 
Appendix I. 

This equation leads to the circuit of Fig. 6, where 
multi-input logic circuits are used. The depth of logic 
is now only two. 



Rowe: A New Approach to High-Speed Logic 


279 



■[0 + y){x + y) + c)[{x + y)(x + y) + c] = s 

Fig. 5 — -Full adder consisting of two half -adders, using 
only two inputs per NOR. Depth = 5. 



5 = (x + y + c)(x + y + c)(x + y + c)(x + y + c) 
Fig. 6 — Full adder consisting of parallel-parallel logic. 


In the example, fewer logic circuits are required in 
the parallel-parallel case, since most of the logic is ac- 
complished in inputs of the logic circuits and the inter- 
wiring. This does not always hold true. The worst case 
of parallel-parallel logic has a maximum of 2 n — 1 first- 
level logic circuits, where n is the number of inputs to 
each first-level logic circuit. There are q second-level 
logic circuits, where q is the number of desired outputs. 
Fortunately, most applications are so specialized that 
only a few first-level logic circuits are required. How- 
ever, in the worst case, the second-level logic circuits 
must handle 2 n — 1 inputs. 

It is possible to reduce the number of logic circuits by 
compromising on depth, as was done by Weinberger 
and Smith. 5 

5 A Weinberger and J. L. Smith, “A one-microsecond adder using 
one megacycle circuitry,” IRE 'Trans, on Electronic Computers, 
vol. EC-5, pp. 65-73; June, 19.56. 
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The Modified Transistor NOR Circuit 

In order to facilitate the use of parallel logic it is nec- 
essary to have a circuit with as large a number of inputs 
(fan-in) and outputs (fan-out) as possible. Since it is ex- 
tremely desirable to use only a single logic circuit, the 
transistor NOR circuit was selected for investigation. 

The basic transistor NOR circuit is shown in Fig. 7. 
A negative voltage on any of the inputs, M, is sufficient 
to cause the transistor to saturate and supply a ground 
potential signal to the outputs, N. The absence of a 
negative voltage on the inputs causes the positive bias 
voltage to maintain the transistor at cut-off. Under this 
condition, the transistor being in a very high impedance 
state, the outputs see a negative voltage equal to 

Ri 

output voltage = V cc > 

xR c + Rr 

where Ri is the input resistor value of a NOR circuit 
being driven by one of the NOR circuit outputs, N; R c 
is the collector load resistor; and x is the number of out- 
puts, N, actually connected to the inputs of other NOR 
circuits. Only one NOR circuit input is driven by one 
NOR circuit output. This equation shows that the out- 
put voltage is reduced as more inputs of succeeding cir- 
cuits are connected to the outputs of the NOR circuit in 
question. 

If the minimum voltage that appears under the fully 
loaded condition is called a “one” (this voltage being 
sufficient to saturate a succeeding NOR), and the ab- 
sence of this voltage (ground) is called a “zero,” the 
logic conditions of Fig. 3 are fulfilled and this is the basic 
NOR operation. 

Since all inputs are resistive, and most of the logic 
is accomplished in the input network, only a small num- 
ber of transistors are required as compared to the num- 
ber of resistors. This is certainly desirable, as most of the 
logic is now accomplished by the wiring interconnec- 
tions and resistors, which can be inexpensive, reliable, 
and miniaturized elements. 

For extreme reliability, the basic NOR circuit seems 
to have a maximum fan-in, fan-out of six, which is 
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Fig. 8 — The modified transistor NOR circuit — type A. 


fairly small when one is considering parallel-parallel 
logic. In order to increase the fan-in and fan-out to 
about 25, a more suitable number, the basic NOR cir- 
cuit has been modified so that the output voltage re- 
mains at essentially power-supply potential regardless 
of output loading. This allows the circuit to drive many 
more outputs than is possible in the basic circuit. 

One means of modifying the NOR circuit is shown in 
Fig. 8. A 24-volt breakdown Zener diode is placed across 
the collector and emitter of the transistor such that out- 
put will never fall below 24 volts when the transistor is 
cut off (output load is restricted to never exceed a condi- 
tion where the output voltage will fall below this value). 
Under this condition the current derived at the high 
voltage V cc acting through the resistor is shunted be- 
tween the load and the Zener diode. As the load varies 
(due to changes in the number of outputs used) the ex- 
cess current is shunted through the diode so that the 
load is continually driven from a 24- volt output signal, 
during a “one” output condition. 

It is certainly possible to replace the diode with a re- 
sistor whose value is chosen such that output voltage is 
always in the fully loaded condition. This reduces the 
circuit flexibility since a different resistor value is re- 
quired for every different condition of output loading. 

A basic disadvantage of this circuit is its high power 
dissipation, arising from the high-voltage power supply 
acting across resistor; R c . This disadvantage has been 
eliminated by the design of a second type of modified 
NOR circuit. 

This circuit is based on the fact that under certain 
conditions a transistor makes an excellent constant 
voltage source. This operation is described in Fig. 9. A 
constant emitter current source is derived by the con- 
stant voltage difference between bias voltages ( V ee and 
Vce ) acting across emitter resistor Re. This contains 
the maximum collector current to be 

I — V ee I — I — Vce | 


where | — Vee\ > | — Vce\ and A is the current gain of 




Fig. 9— (a) Constant voltage circuit, (b) circuit operation plot. 

the transistor. Below this value a range of constant po- 
tential exists. An actual plot of this can be made by ob- 
serving the meters in the circuit of Fig. 9(a) when the 
load resistor Re is varied. This plot is shown in Fig. 9(b) 
for a particular case where I Cmax = 20 ma and V C e — — 24 
volts. The constant voltage portion is shown, and this is 
the operating range utilized. 

This circuit is then combined with the NOR circuit to 
make a very effective modified NOR circuit. (See Fig. 
10.) Transistor 7\ is the basic logic switch, and tran- 
sistor T 2 is the constant voltage source. The power dis- 
sipation of this circuit is considerably less than that of 
Type A circuit. 

The silicon diode placed in the forward direction be- 
tween base and ground prevents the base from being 
overloaded when a plurality of inputs have “one” sig- 
nals applied. This connection makes use of the high for- 
ward-voltage drop of most silicon diodes. This circuit 
has actually been constructed using micro-alloy diffused 
base transistors. The circuit has a simultaneous fan-in 
and fan-out of 25 with an average propagation time in 
the order of 80 m/xsec. 

In actually testing circuits in various applications, 
evidence points out that the average propagation time 
is a more significant measurement of operation speed 
than rise, fall, and storage' times considered individu- 
ally. Average propagation time is measured by taking a 
series string of n logic circuits and applying a pulse to 
the first stage. The average propagation time is the time 
delay for the pulse to be propagated from the first to the 
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last stage divided by n, the number of logic circuits. In 
any particular NOR circuit the actual propagation time 
varies only a few per cent from the average, in the ma- 
jority of cases. Since rise, fall, and delay times do not 
add directly to give actual operating time of com- 
pounded circuits, average propagation seems to be a 
more significant means of circuit operation time meas- 
urement. 

Application of Parallel- Parallel Logic to 
High-Speed Addition 

In order to illustrate the effectiveness of parallel- 
parallel logic, an application consisting of the circuitry 
for the carry operation of a high-speed adder will be 
shown in detail. 

One of the major difficulties in designing high-speed 
adding devices is the drawback of ripple-through 
carry. 1 When making a binary addition, the most sig- 
nificant bit is dependent on the carry signal from the 
preceding stage, which is in turn dependent on the carry 
signal of its preceding stage, etc. This means that the 
most significant bit is dependent on the condition of the 
least significant and every other bit in order. 

It has been shown 1 ’ 5 that the expression for a carry 
signal from any particular stage k of a binary adder may 
be given as 

Ck = Ak-Bk + ( Ak + Bk)Ck- i, ( 7 ) 

where Ck is the carry signal from stage K, Ck - i is the 
carry signal from the preceding stage, and A K and B K 
are the addends associated with bit K. If the substitu- 
tions 

D k = Ak-Br 

Rk — Ak + Bk (8) 

are used for convenience, the carry signals for binary 
adder of n bits may be expanded. Then 

Co = Co. (9) 

Generally, since there is no carry into the least signifi- 
cant stage, Co is 0. 



Fig. 1 1 — Three-stage high-speed carry circuit. 


Cl = Dx + RiCo = D 1 ( 10 ) 

C 2 = D 2 + RzDx + R2R1C0 ( 11 ) 

C3 = Dz + RzDz + RzRiDi + RzR%R]C $ (12) 

C n = D n + R n D n - 1 + RnRn-lDn -2 + * * * 

+ RnRn—l * ' ' R2R1C 0. 

This shows that the carry for any bit n is immediately 
available from a single logic array for each bit. Further- 
more, some of the logic expressions that are used in de- 
termining the carry of a particular bit are used in all the 
succeeding bits, so that much of the circuitry of each 
bit is necessarily repeated throughout. Actual circuitry 
for a three-bit carry circuit is shown in Fig. 11. The 
carry for each bit is arrived at after only two levels of 
logic circuitry, regardless of the bit position. Also, the 
adder of Fig. 6 may be used as the adder circuit, with 
the negation of the carry derived directly from similar 
circuitry. Total addition time is then exactly four prop- 
agation times or approximately 320 m/nsec, regardless 
of the number of bits in the adder. 

For the basic carry circuit the number of logic cir- 
cuits ( L ) is 

L = E (X + 2) = - = (l/2)« 2 + (5/2 )», (14) 

x=i 2 

where n is the number of bits. The maximum number 
of inputs required by any logic circuit is ra + 1, and the 
maximum number of outputs is max k(n — £ + 1), which 
is n(n-\- 2)/4 if n is even, and (« + l) 2 /4 if n is odd, where 
k is bounded by 1 <k<n. Then for the addition of two 
20-bit words (without sign) a logic circuit with a fan-in 
of 21 and a fan-out of 110 is required. (The fan-out can 
be kept below 25 by using a logical design other than 
that given in Fig. 11. See Appendix II.) The carry cir- 
cuit will require 250 NOR circuits. This compares with 
80 logic circuits (fan-in fan-out of 3) for a particular 
twenty-bit ripple carry circuit in use in the NORDIC 
II computer (4 circuits per bit). 4 
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Fig. 12 — Reversible binary counter. 


Eq. (14) shows that by compromising on depth and 
thereby speed, the 20-bit carry circuit can be broken 
down into two 10-bit carry circuits, each requiring 75 
logic circuits with a fan-in of 11 and a fan-out of 30. 
However, the depth is now four, half the speed of the 
20-bit carry case. For a depth of eight, a total of 100 
logic circuits are required, since four 5-bit carry circuits 
may be used. Only 6 inputs and 9 outputs are required 
for each level. This demonstrates the flexibility that is 
available by compromising on speed. An example of 
what can be done with only a limited fan-in and fan-out 
is the SEAC high-speed adder. 5 

Application of Parallel- Parallel Logic to 
Counting Circuits 

Some circuits, such as counter chains, do not fall 
directly into easy application of parallel-parallel logic. 
It is possible to treat such operation in a similar manner 
by making all counters and similar devices operated by 
parallel logical means, instead of direct sequential 
means. To illustrate this point, an ordinary binary 
counter has been examined to determine the logic in- 
volved in switching each stage by a count pulse. Then 
by considering each counter stage as a logical input, and 
determining from this the necessary changes in each 
counter stage for the next count pulse, we can effect a 
complete count cycle in only two propagation times, 
plus the switching time of a counter. 

An example of such a binary counter is shown in Fig. 
12 for 4 bits. It is reversible in that it can count up or 
down. The condition of the counter before a count 
pulse determines which counters should be changed 
when the count pulse appears. When all the counters 
have been changed, the new condition determines the 
change for the next succeeding count pulse, etc. 

The limit of the size of a binary counter of this nature 
depends upon the maximum allowable fan-in, fan-out 
of the NOR circuits used. Thus for a 25-input-output 
NOR circuit, a 24-bit, simultaneous advance, binary 
counter is possible. It is also obvious that the counter 
code used is restricted only by the logic used. Therefore, 
a counter of any desired code is possible. 

Racing and Timing Conditions 

When every input signal is required to travel in paths 
of equal length to the output (i.e., all inputs extend 
through the same depth), the only conditions of racing 
that can occur are due to the variation of the propaga- 


tion time of any logic circuit from the average propaga- 
tion time. This condition is easily bypassed since 
parallel-parallel operation is exactly synchronous. This 
means that all output signals appear simultaneously 
after £ predetermined time delay, when input conditions 
are changed. This time delay (taken for the deepest 
depth used) sets the minimum timq between operations. 
False conditions due to racing occur only within this 
time period and disappear at the termination. All racing 
of this type is easily suppressed by gating the outputs 
so they do not read out during the circuit propagation 
time. 

Conclusions 

Parallel-parallel logic operation allows circuit opera- 
tion at very high speed without the use of much higher 
speed components. The sacrifice for obtaining this op- 
eration is the number of logic circuits required and the 
topological and geometrical problems that arise from 
the complications of interwiring. Fortunately, these 
problems are not unduly severe for most actual applica- 
tions. 

The use of the modified NOR circuit allows operation 
with only one type of logic circuit with a large fan-in 
and fan-out. Furthermore, the logic is accomplished in 
the resistor gating and interwiring most of the time. 
This means that the actual number of transistors used 
is small compared to the amount of logic accomplished. 

The attempt in this paper has been to outline briefly 
some of the possibilities that lie in this type of logic, 
along with its limitations. Although considerable work 
has been done in this area, it is too lengthy and detailed 
for inclusion here. Table I is a summary and comparison 
of the different types of logic and their applications, and 
simply expresses the basic ideas involved. 


TABLE I 


Serial 

Parallel 

Parallel- Parallel 

Logic and function by 
sequence 

Function is parallel 
Logic is sequence 

Function is parallel 
Logic is parallel 

Slow 

Fast 

Very fast 

Minimum equipment 
redundancy 

High equipment 
redundancy 

High Equipment 
redundancy 

Appendix I 


Logic Manipulation of the Full Adder 

Half sum = xy + xy — H. (15) 

Full sum = HC + HC = S, (16) 

where C is the carry from the preceding stage. 

H = (x -j- y) {x + y) = xx + xy + yy + xy — xy + xy (17) 
since xx = yy = 0. 

5 = (H+CXH+C). (18) 

By similar methods, 

5 = \xy-\- xy + C\[xy + xy + c\. 


(19) 
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By^substituting (15) into (19), 

s — [(xy)(xy) + c\[xy + xy + c\. (20) 

Since A +B = A • B, 

s = [(£ + y)(x -f y) + c][xy -f xy + c]. (21) 

Since A ■ B = A + B, 

s = [xy + xy + c\[xy -fi- xy + c\, (22) 

which gives 

^ = xyc + xyc + xyc + xyc, (23) 

since xx = yy — 0. 


This is in AND-OR form. To get the more desirable 
(for NOR logic) OR-AND form, 

s = (x + y + c) + (x + y + c) + (x + y + c) 

+ (x +y + c). (24) 

Since A B — A + B, 

s = (« + y + c){x + y + c)(x + y + c)(x + y + c), (25) 
S = (xy -(- xc T xy -T yc T xc -f - yc T c) 

(xy + xc + xy + yc + xc + yc + c). (26) 

Since A- A — A, 

s = xyc + xyc + xyc + xyc\ (27) 

since A -j-A — A, 

s = (xyc) (xyc) (xyc) (xyc) ; (28) 

since A -\-B —A B, 

s = (x + y + c)(x + y + c)(x + y + c)(x + y + c). (29) 
Eq. (29) is the final form. 



Fig. 13 — Three-stage high-speed carry circuit, limited loading case. 


Appendix II 

In order to constrain the number of outputs to a 
single circuit for such a matrix as the simultaneous carry 
circuit to the limit of »+/, more circuits can be used to 
parallel the loading. This is indicated in Fig. 13 which 
shows the high speed for three bits with a maximum 
of nArl inputs and outputs on each logic circuit. 

The number of circuits required is now 

L = n(n + 2) = ri 1 + In, 

which is certainly much larger than the previous case. 
This shows that an unlimited fan-out is desirable. For 
limited cases, this may be compromised by using parallel 
logic circuits. In either circumstance, the depth of levels 
remains unchanged. 

Acknowledgment 

The author is grateful for the efforts of Dr. T. A. 
Jeeves who initially suggested this approach and who 
has contributed to much of its development. 


Information Retrieval Study* 

ROBERT COCHRANf 


AS is well-known in the computer field, the basic 
/—\ scheme of information retrieval is to search a 
given set of data and extract those records which 
satisfy a certain set of criteria. While many schemes of 
information retrieval are being used, new techniques 
are always being considered and tested so that this very 
important task of computer technology can be perfected 


* Based on work done for the U. S. Army Electronic Proving 
Ground, Ft. Huachuca, Ariz. 

f Computer Applications Sec., Computer Dept., G.E. Co., 
Phoenix, Ariz. 


relative to any given set of circumstances. The impetus 
to develop more effective methods for recalling and 
correlating recorded information originated in the field 
of science and technology. It was evident more than 10 
years ago that considerable improvements in methods 
for using recorded knowledge were long overdue. The 
search still goes on — to study, to invent, and to perfect. 

The utility of information retrieval is self-evident. As 
more and more records and documents come into being, 
the problem multiplies in complexity. The need for 
swift recall is paramount. For example, in medicine, the 
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ready access to the expanding record of laboratory and 
clinical observations is of decisive importance both to re- 
search and to professional practice. The advantages of 
basing decisions in business and Government on a 
broad range of coordinated data are well recognized. In 
the case of library and great document centers, the 
value of a collection of documents and other records, 
from simply a practical point of view, is determined by 
the benefits achieved in using the information contained 
in this collection. 

One of the first entries into this vast field was the 
General Electric Psychological Matrix built for the 
armed forces during World War II. With this pseudo- 
computer, selections of qualified personnel could be 
made in accordance with a limited set of criteria. Since 
that time, progression to more complicated fields has 
been steady; the problem has been viewed more criti- 
cally and with much greater sophistication. Essentially, 
it has come to be realized that the principal process of 
information retrieval requires that the material charac- 
terizing the subject content be inspected as to whether 
it matches the criteria characterizing the information 
requirement. In other words, from the viewpoint of 
logical theory, material selected as pertinent constitutes 
a class whose defining characteristics are determined by 
the information required. This process is not a blind 
search but the basis from which further searches may 
be made. In fact, to satisfy an information requirement, 
a multiplicity of searches may have to be performed. 

Using this theory as a broad base, we may go on to 
specifics. The retrieval scheme to be discussed here en- 
compassed three types of data to be searched using 
three magnetic tapes which had previously been written 
on a card-to-tape converter. Each of the three types of 
information were comprised of a master record and from 
one to nine trailer records. When a particular master 
record was extracted, its associated trailer records were 
also extracted. 

In the first set of data, a master record was denoted 
by a zero in a particular card column. Trailer records 
were denoted by a number, one through nine, in the 
same column. In the second set, the denoting numbers 
were in a different card column. In the third set, no part 
of the record designated it as either master or trailer. 
Entries were limited to two records: first, a master; 
second, a trailer. A sense switch setting on the computer 
console was used to determine which tape to search. 

The manner in which to specify criteria on which to 
base the master record search was the first problem to 
solved. This problem was complicated by the fact that 
each of the three sets of data had up to 20 fields which 
might have to be searched. These fields could be of vary- 
ing length, and they could be continuous, interrupted, or 
overlapping. As an example, suppose digits 1 through 6 
were a date group. Field A could be digits 1 and 2 for 
yeaf; field B could be digits 3 and 4 for month; field C 
could be digits 5 and 6 for date of month ; field D could 
be digits 1, 2, 3, and 4 for year and month; and field E 


could be digits 1, 2, 3, 4, 5, and 6 for the entire date. 
To circumvent this problem, a decision was made to 
allow the criteria specification to designate the field also: 
fields would be defined and each field assigned a desig- 
nating letter from A through T. 

Let us now go to the query. Punched into a Hollerith 
card and read from the on-line card reader, this query 
should be of conjunctive normal form. That is, all cri- 
teria would be joined by the AND condition unless 
Otherwise specified. All criteria joined by EITHER-OR 
would be qt the end of a query. It would now be possible 
to have a query “A and B AND-EITHER C or D.” A 
symbol for NOT and a symbol for BETWEEN-AND 
were included in the logic to enlarge the capability of 
the query. Thus, the query could now be written: 

A and B AND NOT C AND BETWEEN D and E 
and EITHER F or G. 

The letters X, Y, and Z were used to denote the three 
logic symbols and would precede the symbol for the field 
they would affect. Individually, the meaning assigned 
to these letters was: X = LIMITS; F=NOT; and Z 
= EITHER-OR. 

Since the criterion specified could be of variable 
length, the one remaining item to be resolved before it 
would be possible to write a query was the establish- 
ment of a means whereby the computer could recog- 
nize the end of a criterion. Since it is possible for a cri- 
terion to contain a character which would be the same 
as a field or logic symbol, the presence of such a symbol 
alone would not suffice. Therefore, a decision was made 
to let a slash mark (/) denote the end of a criterion and a 
comma (,) separate the lower and upper bounds of the 
limits. 

In searching a tape, the first operation is reading the 
query from the decimal card and converting it to binary 
coded decimal form. This is the same form in which the 
information is written on magnetic tape. Then one rec- 
ord — the first master record to be interrogated — is 
read from the applicable magnetic tape. The program 
now begins a character-by-character analysis of the 
query. 

If the first character of a query is a logic symbol, one 
of three flags is set so that these flags may later be inter- 
rogated and the correct action taken when needed. If it 
is a field symbol, it is saved, so that later an appropri- 
ate subroutine may be selected to isolate the descriptor 
from the tape record in the particular field under inter- 
rogation. After the field symbol has been detected and 
saved, the program examines the “limits” flag. If this 
flag has been set, the computer assembles character 
after character from the query until a comma is de- 
tected. 

These assembled characters are then set up as the 
lower bound, limit A. Then, characters are assembled 
from the query until a slash mark is detected and set up 
as the upper bounds, limit B. If the limits flag has not 
been set, character after character is assembled from the 
query until a slash mark is detected. When this occurs, 
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the assembled characters are set up as the “criterion.” 
Once either the limits or criterion has been set up, the 
next character or symbol from the query is examined 
and saved. 

The first field symbol saved is examined and the ap- 
propriate digits from the tape record are isolated and 
set up as the “descriptor.” 

The program is now ready to begin the actual com- 
parison. Again it is necessary to interrogate the limits 
flag to determine 1) if an exact match is desired or 2) if 
the descriptor is to be between two specified limits, 
numerically or alphabetically. If the descriptor is to be 
between two limits, it is first compared with the lower 
bounds. If it is less than the lower bounds, the NOT or 
negated condition flag is inspected. If the descriptor is 
not within the established bounds and the field is 
negated, a matched condition arises and control is trans- 
ferred to the appropriate routine. A mismatched condi- 
tion arises if the field is not negated, and again control is 
transferred to an appropriate routine. 

The descriptor is compared with the upper bounds if 
it is equal to, or greater than, the lower bounds. If it is 
greater than the upper bounds it is not within the estab- 
lished limits. Therefore, the mismatched routine is used 
if the field is not negated; the matched routine is used 
if the field is negated. The descriptor is within the estab- 
lished limits if it is less than, or equal to, the upper 
bounds. Therefore, a matched routine is used if the 
field is not negated or a mismatched routine if the field 
is negated. 

If the limits flag was not set we look for an exact 
comparison between the descriptor and the field cri- 
terion. If the descriptor agrees with the criterion, we 
must interrogate the negated condition flag. If the field 
was not negated, we have a match; if it was negated, we 
have a mismatch. If the descriptor did not agree with 
the criterion, we must also test the negated condition 
flag. This time, if the field was negated we have a mis- 
match and if the field was not negated we have a 
match. 

After the comparison, control is transferred to either a 
matched routine or a mismatched routine whether the 
descriptor was to be between two limits or to agree ex- 
actly with the criterion. In the mismatched situation, 
the OR condition flag must be interrogated. If the OR 


condition is not in effect, at least one item has been 
found which does not satisfy the query. This informa- 
tion is sufficient to eliminate this particular record. 

Records are read from tape until another master rec- 
ord is found. Then all flags are reset to OFF and the 
query is processed as before. If the OR condition is set, 
it is possible that only one of a set of OR criteria has 
been mismatched. In view of this, it is now necessary to 
inspect the next symbol which has been saved. If this 
next symbol is a blank, the end of the query has been 
reached and the further possibility of a match falling 
under this OR is nil. It is then necessary to read tape 
records until another master record is encountered. 
Then the process is repeated. If this next symbol is a 
Z, or EITHER-OR symbol, again, there is no possibility 
of a match under the present OR; and this master rec- 
ord, together with its trailer, is skipped. If the next 
symbol is other than a blank or a Z, it indicates that 
the next criterion falls under this OR. Therefore, all of 
the flags except OR are reset and the next field is proc- 
essed as outlined. 

It is also necessary to examine the OR condition flag 
in the matched condition. If this flag is not set, one field 
criterion has been satisfied and the next field is ex- 
amined. If an OR condition already exists, one of the set 
of OR criteria has been satisfied and no other criteria 
need be examined. The remainder of this EITHER-OR 
set is skipped until another EITHER-OR set of criteria 
is reached or the end of the query occurs. If another 
EITHER-OR set is met, it is processed in the same 
manner. If the end of query is reached, the record has 
satisfied all conditions for this query, and it, plus all 
its trailer records, are printed out. Another master rec- 
ord is then read and processed. 

This method of inspecting the query and interrogat- 
ing master records repeats for record after record until 
the end of a file of information is encountered. At this 
point, all the records have been examined and all those 
that satisfy the query have been printed. The tape is 
rewound and the program is now ready to process an- 
other query. 

This retrieval system is atypical— its utility is mani- 
fold. As well as being a direct tool for retrieving past 
information and for solving immediate problems, the 
system is also used to plan for the future. 
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Communication Across Language Barriers* 

W. F. WHITMOREf 


I AM not sure whether my proposal is properly 
classed as a “blue sky” development; perhaps it’s 
only on the edge of the stratosphere. What I am 
seeking is some means of communicating a fairly limited 
set of ideas rapidly and unambiguously across language 
barriers. It should treat all languages on an equal foot- 
ing (if it does not evade the use of words entirely) , and 
it might be advantageous if it did not demand literacy. 
I want to indicate here why I think such a device would 
be useful, and some possible methods for determining 
its characteristics. I am not an electronics specialist, and 
I don’t propose to present circuit diagrams and the 
contents of black boxes. I have spent a good deal of my 
life writing operational requirements, and it is an opera- 
tional requirement which I am going to present now. 
Once convinced of the need, I think computer experts 
can supply the device. 

One of the most difficult aspects of the situation to 
understand is that such devices are not in heavy demand 
and are not already in existence. In some respects they 
do exist, of course: international flag signals for use by 
ships, the agreed code of highway warning signs in 
Europe, the Morse SOS and the voice call MAYDAY 
for distress, the reasonably universal symbols of mathe- 
matics and engineering. But, as far as computers go, the 
emphasis seems to be on translating literary or scien- 
tific texts, principally from Russian into English, with 
involved problems of shades of meaning and grammati- 
cal word order. This is admirable and difficult and chal- 
lenging, and remarkable results are being achieved. But 
the machines involved are far from portable, and they 
don’t work in real time. It should not take a high-speed 
digital computer to transmit a military order, control 
international air traffic at a busy airport, or order a meal 
in Southeast Asia. 

The ultimate operational requirement was expressed 
to me once by a Marine colonel in the following terms: 
“Look, Doc, I want a walkie-talkie and a set of coils. 
When I’ve got a Greek regiment on my flank, I put in 
the ‘Greek’ coil. Then I talk English and he hears 
Greek.” That expresses one form of the need admirably, 
and if that particular form of realization is possible, I 
should think the “blue sky” designation would be ap- 
propriate. However, something considerably short of 
the ideal would be most valuable. Another military ex- 
ample of the need : the London Economist reported that 
the first tactical order to the multilingual United Na- 
tions Force after arrival in Suez took four hours to 

* The views expressed herein are those of the author and do not 
necessarily reflect the official views of the U. S. Navy. 

t Special Projects Office, Bureau of Ordnance, USN, Washing- 
ton, D. C. 


transmit. This would be a rather intolerable situation 
in the heat of battle. 

With a stage set by these somewhat specific examples, 
let’s consider in broader terms the situation which 
causes the need for communication across language bar- 
riers. It starts from the fact that, in regard to foreign 
languages, Americans are illiterate and provincial. “If 
those foreigners want to talk to us, let ’em learn Eng- 
lish,” regardless of the fact that in structure and spelling 
it is one of the more difficult languages to learn. This 
attitude did no great harm when the United States was 
isolated from the rest of the world, and when other 
nations mainly wanted things from us. But with the 
present resources of communication and transportation 
we are far from isolated, and we want (and desperately 
need) support from other nations in the cold war with a 
rival superstate. Precisely at this time, these other states 
are beginning to be aware of their own national entities 
(largely as a result of U. S. ideals of national self- 
determination), and are developing a feeling that their 
languages are just as good as ours. In a very real sense, 
the psychology of a nation is expressed through its 
language. The diplomatic consequences of American in- 
ability to speak foreign languages are evident in the 
daily papers, and everyone is aware of the stereotype 
of the American tourist who makes it obvious that in- 
ability to speak English is a sign of basic stupidity. 

Another approach to this need is the universal 
language, such as Esperanto. This has been with us for 
decades without reaching any general acceptance, and 
I personally have strong doubts that it will do so even in 
the long run. The structure of proposed universal lan- 
guages is in the Indo-European family of tongues, and 
there are a multitude of languages of importance to us, 
but outside this framework. Pei 1 gives a glimpse of some 
of them. As Whorf 2 has pointed out, some American 
Indian languages, such as Hopi, have a structure quite 
alien to the Indo-European grammar and word order. 
For example, the Hopi view of an event always includes 
both space and time, so that his language functions 
adequately without tenses for verbs. In any event, the 
present needs are too imperative to wait upon the slow 
progress of any of the universal languages. This is not 
to deny that the problem would largely vanish if a uni- 
versal language were achieved. However, in the short 
term, we might hope for something nearer the situation 
of the written language in China. In all parts of China, 
the written ideographic language can be read, no matter 

1 M. Pei, “The World’s Chief Languages,” S. F. Vanni, New 
York, N. Y.; 1946. 

2 B, L. Whorf, “Language, Thought, and Reality,” J. B. Carroll, 
ed., Wiley-Technology. Press, New York, N. Y., p. viii; 1956. 
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how widely the local dialects of the spoken language 
may vary. The sign language of the American Indians 
is another example. 

I hope that you are convinced by now of the need for 
the proposed device. How should one set about getting 
it? Three or four years ago, when I first became inter- 
ested in this question, I talked to some electronics peo- 
ple. None of them saw any great problem in construct- 
ing such a machine, providing they had a reasonable set 
of requirements from the user. So the first step would 
seem to be to collect a set of prospective users, and try 
to find out what their real needs for communication are, 
with a view to minimizing these requirements, rather 
than particularizing subtle shades of meaning. In a 
tactical military application, for example, it should be 
possible to assemble (perhaps at one of the War Col- 
leges) a group of senior military officers with experience 
of field command, together with information theory 
specialists — maybe even advertising agents! This group 
would compile a vocabulary and grammar of the tac- 
tical orders and concepts which would need to be trans- 
mitted in situations involving the joint participation of 
U. S. and indigenous forces, with the requirement to 
keep these brief and unambiguous. Adequacy of the re- 
sulting lists might be tested with map exercises and war 
games involving foreign officers. The end result should 
be a body of tactical information of moderate size which 
is to be communicated across language barriers. A simi- 
lar procedure might be used in other areas where the 
problem arises, such as airways control, ship handling, 
highway traffic regulation, and so on. I think basic 
English comes under my strictures against the universal 
language solution, but the underlying attempt to boil 
down English to a minimal structure which is still ade- 
quate for rudimentary communication is certainly a 
good precedent for the information groups to follow. 

Once confronted with a manageable body of informa- 
tion to be processed, I feel sure the computer experts 
will be able to take it from there. The resulting equip- 
ment should certainly be portable and rugged enough 
to stand handling in the field. It would be nice to con- 
template something as compact and personal as the 
present transistor pocket radios. For instance, is it pos- 
sible to fit a device of this size with an indexed memory 
and some sort of transceiver so that a simple code trans- 
mission would cause the “radio” to repeat a particular 
selected command, e.g., “Advance 100 yards at sunset,” 
or “Mortar fire on Hill 209”? Or perhaps the memory 


device will be so bulky that it will be an adjunct to field 
telephone installations, receiving a coded signal and 
playing a message in the appropriate language. Note 
that this compression of stereotyped messages is already 
employed, for example, by Western Union in its canned 
greeting messages for birthdays or Christmas. 

The vocal communication may not be the best for all 
situations. In many cases, comrryinication by sketches, 
maps, or conventional signals (flag hoists, again) may 
be better adapted to the particular problem at hand. In 
aircraft control, one can operate directly on the pilot’s 
instruments. For example, in the integrated cockpit 
display system sponsored by the Office of Naval Re- 
search (ONR), one could indicate a desired destination 
directly on the navigational plot, or put maneuver sig- 
nals on the contact analog display. Indeed, the ability 
shown in the ONR project to get completely free of pre- 
conceptions about knobs and dials would be a necessary 
requirement for the designers of the communications 
equipment. The obsession with the spoken or written 
word may, in fact, be an unnecesssary handicap in ap- 
proaching the problem. Certainly a miniaturized form 
of the facsimile transmission used to send weather maps 
would be a contribution to the basic problem. 

It would be possible to develop at some length the 
advantages which might result from the type of device 
here proposed. Doubtless this audience can suggest 
many which have not occurred to me. But for the 
moment, it seems better to leave the problem fairly un- 
adorned and let discussion bring out further points. It 
should be remarked in closing that the proposition is a 
“blue sky” one in an unexpected and frustrating sense — 
you have been presented with a suggestion for construct- 
ing a machine without having the basic design require- 
ments formulated. I firmly believe that the essential first 
step is the conference of prospective users, and that the 
computer designers will have to wait in the wings until 
a specific body of information for communication has 
been proposed. So, in a sense, this paper is being given 
to the wrong audience. However, perhaps some of you 
have dual allegiance to requirements as well as to hard- 
ware; the rest of you probably have colleagues who 
should be prompted into the requirements type of ac- 
tivity. In any event, I hope there will be enough meat 
in this paper to persuade you to campaign for a set of 
requirements which will allow the computer experts to 
produce a useful and portable method for communi- 
cating across language barriers. 
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Symbolic Language Translation 

EUGENE C. GLUESINGf 


Introduction 

ALTHOUGH interest in the problems and develop- 

r~\ ment of techniques for machine translation of 
languages has been growing in the United States, 
no organized program comparable to that of the Rus- 
sians has been developed. In that country, teams of 
specialists have been working in this area for several 
years. Announcement of a “breakthrough” in the field 
comparable to that of Sputnik in space travel is not to 
be unexpected from them. 

The ideas expressed in the following report were first 
suggested by an article by Nagel. 1 Basically, they can be 
summarized, and at least partially justified, by the 
definition of language translation offered by Richens 
and Booth. 2 

“Taken in its most general sense, translation is the 
substitution of one language for another to express the 
same set of ideas. [The emphasis is mine.] It should 
proceed by a one-one substitution of symbols for each 
of the ideas expressed. ...” 

Symbolic Language Translation 

The vagaries, anomalies, and ambiguities of language 
format and meaning, as well as widely variant linguistic 
structures have confused many literary translators; 
also, they offer many obstacles and complications to the 
adaptation of computers to language translation. For 
most practical translations of one language into another, 
it is necessary and sufficient that the meaning of the 
texts be conveyed from the reference language to the 
object language. Structure and harmony and beauty of 
phraseology need not be preserved provided this mean- 
ing is conveyed. 

The intent of this paper is to indicate that through 
the use of an intermediate language, it is both possible 
and practical to apply electronic computers to the task 
of translating any existing language (or languages). 
The intermediate language proposed for use in transla- 
tion consists basically of that of symbolic logic, with 
perhaps an extended set of symbols designed to simulate 
specific grammatical groupings or terms such as parts 
of speech, phrases, and possibly some idiomatic ex- 
pressions, which are not traditionally represented in 
symbolic logic. The increase in the number of symbols 

t Remington Rand Univac, St. Paul 16, Minn. 

1 E. Nagel, “Symbolic logic, haddock’s eyes, and the walking dog 
ordinance,” in “The World of Mathematics,” Simon and Schuster, 
New York, N. Y., pp. 1878-1900; 1956. 

2 See [9], p. 25. 


needed may entail the increase in machine bit repre- 
sentation of characters from six to seven (excluding the 
parity bit). This would give an additional 64 characters, 
deemed sufficient for the purpose. Alternatively, and 
perhaps more reasonably, would be the employment of 
combinations of presently existing machine symbols to 
fulfill the requirements of the additional symbolic 
notations. 

In essence, the entire system may be compared in ap- 
plication to the pictogram type of symbolic characters 
of the Chinese language. The diagrammatic representa- 
tion of objects or ideas there makes it frequently possi- 
ble to conduct written discourse among some groups 
even though they cannot communicate orally. 

The Problem Defined 

The problem of language translation through sym- 
bolic conversion seems more and more to approach that 
of finding an appropriate symbolic vehicle, i.e., a set of 
rules and expressions which will adequately convey the 
meaning of a statement from one language to another 
and at the same time lend itself quite readily to trans- 
lation from language to the symbols and vice versa. 
Because of the complexity and variability of language 
structure, flexibility in the intermediate symbolic lan- 
guage is important. 

For example: 

1) @ ? [XR(XR'MW)] 

That is to say, “How does one convince himself 
that he knows what a word means?” 

2) (ALM)[T(P)(S^M')] = 1 

or, “In logic and mathematics the statement that 
the same statement always has the same meaning 
is largely true.” 

The above two statements, 3 translated into symbolic 
form, are illustrative of both the problems and the 
probabilities of the use of symbolic notation in language 
translation. This translation was made by using the 
generally accepted symbols of symbolic logic supple- 
mented by symbols supplied as needed by the writer. 
The use of any set of such symbols desired may be de- 
veloped by the individual translator if the interpreta- 
tion of the symbol is provided. 

To illustrate, let us take statement 2) above and give 
the step-by-step translation of language to symbols 
to language, using English and German. 

3 


See [7], p. 71. 
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English Symbols 

In all logic and mathematics ( ALM ) 

the truth of a statement (P) P(P) 

that a symbol implies a meaning (S-+M') 

is near to 1. =1 


Since the symbols other than the conventional logic 
symbols should have an accompanying explanation, 
they should be supplied with each statement. Thus: 

Z, = “int” logic (Logik) 

M—“ int” mathematics (Mathematik) 

A=“int” symbol (Symbol) 

M' — “int” meaning (Bedeutung) 

1 = “int” nearly one (nahezu Eins). 

The above statement designates how it is possible, 
given a constant meaning for a set of symbols, to trans- 
late from one language into another through a computer, 
given a set of bit configurations for each symbol and 
a dictionary of corresponding words and phrases, in any 
given language, for each symbol. The fact that the 
symbols can be interpreted in any language would make 
it possible, with limitations, to translate any language 
to any other through the symbols. For example, if the 
symbol, T(P), always refers to the truth of a proposi- 
tion or statement, going from the German, “die Wahr- 
heit einer Festellung,” to the symbol T(P), it can then 
be matched with the corresponding phase in French, 
Italian, Indian, Russian, English, or any other language. 

Two Approaches 

One of the more significant questions to be resolved 
in the development of this system is: To. what extent 
should the symbolism replace the language (• i.e ., words)? 
Regardless of how the problem is handled, a large 
amount of storage would be needed- — virtually enough 
to hold the vocabulary desired. Two relatively disparate 
approaches offer themselves: 

1) To develop a large number of symbols (in essence 
a new, symbolic language) to represent words 
and/or phrases, or 

2) To utilize a minimum number of symbols, enough 
to convey the meaning, with identifiers which are 
immediately translated within the construct of 
each statement. 

The statement already discussed, 2) would fall into 
the first category, provided a separate library of sym- 
bols and meanings were instituted. An example of the 
second type of treatment would be the proverb : 

( x)(x is in the hand) 

— >(x = y)(x is a bird)(y = 2x in the bush). 

It is believed that for all practical purposes, the sec- 
ond method would be the faster and more efficient, in- 


German 

Fur alle Logik und Mathematik 
die Wahrheit einer Festellung (P) 
dasz ein Symbol eine Bedeutung enthalt 
ist nahezu Eins. 


asmuch as less preparation, time, and effort would be 
required by the coder, and the alpha characters are al- 
ready established in the computer. 

It must be remembered that, regardless of which ap- 
proach is used, the original translation, from language 
to symbols, must be manually (humanly) produced. The 
primary advantages would be derived from the fact 
that: 

1) It is not necessary for the coder to know any 
foreign language in order to effect a translation. 

2) Translation into as many foreign languages as 
exist in the libraries of the computer is equally 
possible, once the symbolic coding is completed. 

A third approach to the translation problem would 
involve the possible combination of symbolic designa- 
tion with the procedures involving grammatical and 
structural analysis often described. No attempt has been 
made thus far to explore this possibility. 

Some Considerations 

To return to the first approach, namely that of de- 
veloping a complete symbolic language, a few additional 
explanatory remarks include: 

1) The necessity for establishing a large library of 
symbols would present considerable difficulty. 

2) Certain well-defined rules of operation and identi- 
fication procedures would require development. 
Examples of these: 

a) Verbs could all be expressed as relations, viz., 
R — seeing, testing, etc. The tense may be in- 
dicated by subscript notation, i.e., R\ = present, 
i? 2 = past, Rs = past progressive, etc. The pro- 
gressive or participial form is used exclusively. 

b) The subject will generally be designated (x). 

c) Separate tables of symbols for prepositions, 
conjunctions, certain adverbs and other parts 
of speech. 

d) Use of normal, logical symbolism wherever 
applicable. 

e) Use of some method, such as over-or-under- 
lining to designate various types of phrases 
and/or clauses. For example, a line under an 
element to indicate it is the object of a 
preposition. 

f) Introductory phrases, parenthetical phrases, 
and words and phrases which do not have a 
definite function in the sense or meaning of the 
statement may be omitted in the symbolic 
form, unless they enter into the meaning. 
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g) Where variant meanings of words exist, a sub- 
script to designate the intended meaning could 
be employed. Take, for example, the word 
“fast” : 

F\ = “int” rapid 

F 2 — “int” motionless 

Fz = “int” abstain from food 

^4 = “int” having zest for living. 

h) Active and passive voice or verbs could be in- 
dicated by an arrow pointing left or right above 
R, indicating the direction of the relation: 

R = “int” passive R 
R= “int” active R. 

i) It may be better in many instances to translate 
idiomatic and colloquial phrases into conven- 
tional phraseology in order to convey the 
meaning, and thence into symbolic form. 

j) Generally, terms would be units of structure in 
the statement, and symbols would indicate re- 
lationships between terms. 

k) Grouping of an element with its modifiers is 
recommended. 


It is anticipated that many other rules would develop 
as the system progresses. The ones listed above are 
merely suggestive; they are not to be construed as 
necessary or final. 

Let us consider some further examples: 

1 ) (Ax)M'(RP-R'W) J (AS)(TUF)++P 
x=“int” Symbolic Logic 
M' = “int” meaning 
i? = “int” attaching 
— = “int” “but not” 

^ = “int” because, since, as, inasmuch as, for. 

Here the symbol (T) is used for the group of subordi- 
nating conjunctions similar to “because.” The state- 
ment could be interpreted: “For all symbolic logic, the 
meaning has the relation of attaching ( R ) to a proposi- 
tion (P), but not the relation of attaching to a word, 
because for all statements the statement is either true 
or false if and only if it is a proposition. 

Illustrating the clarity of expression in a case where 
the English is not too clear, let us consider: 

“The graduating boy did not know whether he pre- 
ferred books and food, or money and clothes and pets, 

or a new automobile.” 


Symbolically: 

(Ex) (xG) [x — RR'(Ar\B) UC](~\[(DC\E) YF ]. 


A = “int” books 
B = “int” food 
C = “int” money 
D = “int” clothes 
E = “int” pets 


F=“int” new automobile 
G = “int” graduate 
P=“int” knowing 
R' = “int” preferring 
x— “int” a boy. 


Here the flexibility of the language is shown in that, 
if a running vocabulary is maintained for each state- 
ment, the corresponding terms can be translated to 
another language regardless of the symbols used. 

Again, in the case of poetry, where in many cases 
meanings are obscure and subject to the whim of the 
interpreter, the following selection from “Music Hall 
Song ” 4 may be illustrative: 

(Ex)(IR 2 x) U(x~R')~* 

(YR"?)(IR"'x). — ! 

x = “int” donkey R' — “int” going 

i ?2 = “int” having (past) P" = “int” thinking 

F=“int” you P'" = “int” walloping. 

— = “int” no 

Original: 

“If I had a donkey wot wouldn’t go 

D’ye think I’d wallop him? No, no, no!” 

Two things are suggested here: 

First, it may be a good idea to indicate verb mood 
with some symbol. 

Second, it may be better to use numeric superscripts 
rather than prime designations to avoid awkwardness 
in sentences with many verbs. 

In many cases, any attempt at literal translation 
would yield only failure. Such is the case in expressions 
involving puns, colloquialisms, modern jive talk, and 
idioms. For example, “Mud thrown is ground lost,” 
would require rewriting to explain the meaning. The ex- 
pression, “The team caught fire and won the game in 
the dying moments,” would require some interpretation 
and revision, as would, “Dig that cra-a-a-zy cat!” 

In summary, the effective translation of the meaning 
of language by machine involves: 

1) The use of symbolic logic symbols as an inter- 
mediate language for transmitting the meanings 
of sentences. 

2) An additional set of symbols to designate word 
groups, phrases, idioms, and other expressions not 
effected by the existing symbols of logic. 

3) Either an extension of the bit structure of charac- 
ters by one bit to include the additional symbols, 
or the combination of presently-used characters 
to form new symbols. 

4) Dictionaries of corresponding terms in both the 
reference and target languages. These could be 
limited to the specific vocabularies required for the 
topic areas, such as science, mathematics, and 
electronics. 

5) Trained and qualified individuals to translate the 
reference text into the intermediate symbols and 
to translate this symbolic language into the target 
language. 

4 J. Beuler, in “The Oxford Dictionary of Quotations,” Oxford 
University Press, Oxford, Eng., 2nd ed., p. 41; 1955. 
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Conclusion 

It is not expected that the method outlined here will 
preserve the aesthetic values of poetry, the local color of 
colloquialisms and idioms, nor the fluidity of oratory. In 
many cases, it may even be difficult to preserve the 
meaning, as in the not-so-new example of the machine 
translation of the Biblical quotation, “The spirit is will- 
ing, but the flesh is weak,” which came out in one Rus- 
sian translation, “The whiskey is good, but the meat is 
rotten.” A method which can be put into operation with- 
out too much time and difficulty is all that is intended. 

The ideas presented here are neither complete nor 
fully developed. They are fragmentary and intended to 
stimulate further thinking and development. Entry into 
a new area of application is usually slow, incomplete, 
and often muddled and confused by human stupidity. 
Such attempts, however, are necessary even to the slow 
progress of fundamental understanding. Beginnings 
must be made somewhere, sometime. Perhaps this re- 
port will establish one such beginning. 
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A Generalized Scanner for Pattern- and 
Character-Recognition Studies 

W. H. HIGHLEYMAN f and L. A. KAMENTSKY f 


Introduction 

I N the past 30 years, various ideas have been pre- 
sented for machines which could recognize spatial 
patterns. With the advent of the digital computer 
and its use in data processing, there has been a great 
increase of interest in the automatic conversion of hu- 
man language to language understandable by a machine. 
The translation to machine language of spatial sym- 
bols — pattern recognition — is important to this con- 
version. 

A subclass of pattern recognition is character recog- 
nition, i.e., the translation of a given set of hand- 
written or printed symbols to machine codes represent- 
ing each symbol or a group of symbols. Many of the 
methods proposed for character recognition have not 
been evaluated critically. It is the purpose of the gen- 
eralized scanner described in this paper to facilitate a 
systematic study of character-recognition techniques 
and an evaluation of methods prior to actual machine 
development. 

f Bell Telephone Labs., Inc., Murray Hill, N. J. 


The efficient study and evaluation of character-recog- 
nition methods requires the aid of a machine because of 
the quantity of data reduction involved. Such a ma- 
chine might take one of two forms. For each recognition 
method a special-purpose machine may be constructed, 
tested, developed, modified, and perhaps reconstructed, 
until it either works or has proved unfeasible. The sec- 
ond type of machine is a general-purpose machine that 
is capable of simulating any recognition method, such 
that new methods and modifications of methods re- 
quire only the generation of programs on paper. 

One of the foremost problems in the development of 
a general-purpose character-recognition machine is that 
some methods require rather complex scans. These 
scans reduce the input information to a more efficient 
form. This problem of scan complexity might be solved 
by building a relatively simple scanner, such as a high- 
resolution raster scanner, to convert the spatial pattern 
to electrical signals. A general-purpose digital computer 
could then be used to translate this simple scan to the 
more complex scan. This same computer would then 
simulate the recognition logic of the particular system 
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under investigation, producing a recognition code cor- 
responding to the simulated scan information. 

Such a simulation method was found to be more ex- 
pensive in terms of cost of equipment and required 
digital computer time than the machine which is the 
subject of this paper. This machine consists of a pro- 
grammable scanner for simulating the desired scan, and 
a digital computer for simulating the recognition logic. 
In this case, no computer time is required for scan 
simulation. 

This paper describes a programmable, or generalized, 
scanner that has been developed at the Bell Telephone 
Laboratories. In conjunction with the IBM 704 Elec- 
tronic Data Processing Machine, the scanner is intended 
for use as a laboratory tool in the investigation of pat- 
tern- and character-recognition methods. 

Description of the Generalized Scanner System 

The design of the generalized scanner system was 
motivated by several objectives including 

1) Suitability for large data volume studies; 

2) Suitability for studying different methods with 
reasonable scanner and computer times; 

3) Suitability for scanning opaque forms containing 
handwriting as well as larger patterns; 

4) Output in a form compatible with and efficient for 
the IBM 704 computer. 

The generalized scanner is basically a flying-spot 
scanner with a scan configuration that can be pro- 
grammed. An opaque pattern to be studied is illumi- 
nated by the cathode-ray tube light source, and the re- 
flected light is detected and quantized into two intensity 
levels. 

An output on magnetic tape is provided in a format 
compatible with the IBM 704, indicating the reflec- 
tivity and associated scan position of that part of the 
pattern being scanned. 

Method of Scanning 

The document to be studied is placed on a document 
carriage, and the desired pattern is positioned in the 
scanning area of the cathode-ray tube. The pattern is 
illuminated by the light spot from the cathode-ray tube. 
The spot moves in constant time (200 jusec) from any 
given point on the pattern to any other point. The scan 
is established by specifying a sequence of coordinates of 
the light spot as a function of time (Fig. 1). Any one of 
10,000 points in a 100 by 100 array can be specified as 
the end point of a scan segment. For each scan segment, 
the coordinates of such a point are specified; the beam 
moves from its present position to the specified point, 
following a straight line with uniform velocity. Here it 
will remain until the next point is specified. In this man- 
ner any scan can be approximated by a series of straight 
line segments. The 10,000-element resolution is sufficient 
to generate complicated scans, or give good resolution 
for rectilinear scans. 



Fig. 1 — Method of scanning. 


cathode - 
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Fig. 2 — Block diagram of generalized scanner. 


Generation of the Scan 

Fig. 2 illustrates the major functional blocks of the 
generalized scanner system. The information necessary 
to control the scan pattern and the output format is con- 
tained on a continuous loop of magnetic tape. This pro- 
gram tape is produced by the IBM 704 by means of 
program-generation routines for each type of scan pat- 
tern. The generated program tape contains six bits plus 
a parity bit per line — the IBM 704 bit arrangement on 
tape. 

Three lines of tape are assembled in the input register 
from the program tape to form an 18-bit instruction 
word. These 18 bits in the tape format shown in Fig. 3 
are interpreted as one of five operations. These opera- 
tions fall into two groups: scan operations and format 
operations. The scan operations specify a scan to a 
point ( X , Y) determined by 14 bits of the total instruc- 
tion. Two types of scan operations are available: 1) a 
scan with the beam unblanked (a “1” in the 5 position), 
and 2) a scan with the beam blanked (a “0” in the S 
position). The three format operations are start-of-pro- 
gram, write end-of-record (EOR), and write end-of-file 
(EOF). The start-of-program operation is represented 
by a unique 18-bit code that identifies the beginning of 
the program on the program tape loop. The end-of- 
record is used for the proper organization of output data 
for later use with the IBM 704; the end-of-file indicates 
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the end of the program (the scanning of one pattern has 
been completed). The end-of-records and end-of-files 
written on the output tape have the same tape repre- 
sentations as are used in the IBM 704. With an Ampex 
FL-100 tape machine, 60,000 instructions can be con- 
tained on a single tape loop, and instructions can be exe- 
cuted at a rate of 4000 per second. 

The instruction contained in the input register is 
interpreted by the control unit. If it is a scan instruc- 
tion, the 14-bit encoding of the coordinates is trans- 
ferred to the decoder. A pair of 200-jusec ramp currents 
are generated by the decoder (one for the X coordinate, 
one for the Y coordinate) and are used to drive the de- 
flection yokes of the cathode-ray tube. The initial cur- 
rent of each ramp is proportional to the value of the 
originating coordinate ; the final current is proportional 
to the value of the terminating coordinate of the scan 
segment. The set of ramp currents causes a scan to be 
produced in an image area that can be any size from 
1/8 by 1/8 inch to 3 by 3 inches. 

Mark Detection 

Two opaque areas are actually scanned by the cath- 
ode-ray tube light source (Fig. 4). One contains the 
document under investigation; the other, a blank piece 
of paper. The reflections from the two areas are each 
focused on two photomultipliers, each sensitive to a dif- 
ferent color. Hence, the scanner has a 2-color capability. 
The outputs of the photomultipliers for each color are 
differenced. This compensates for changes in CRT in- 
tensity, and for multiplier sensitivity as a function of 
beam position. 

A DuMont Dual Opaque Flying-Spot Scanner has 
been modified for the scanning and pickup functions. 

Recording of the Scan Information 

Before scanning a particular pattern, a code identify- 
ing that pattern is set into the toggle switch pattern 
identification register (PI DR, Fig. 2) by the machine 
operator. This code is the first piece of information writ- 
ten on the output tape (an Ampex FR 400) during the 
scanning of that pattern and serves to identify it in 
later processing by the IBM 704 computer. The scan 
information following the identification code includes 
the photomultiplier output information and the end- 
point coordinates for each scan segment using the tape 
format shown in Fig. 3. 

A word length of 18 bits, or three tape lines, is written 
on the output tape for each scan segment. For a given 
scan segment, the output word contains the 14-scan 
address bits ( X , F) which were specified by the instruc- 
tion word. This serves to identify the scan information 
with the corresponding scan segment. The outputs of 
the two photochannels are indicated by two other bits 
(PI, P2), a “1” being written if a mark of that particular 
color were crossed by the scan segment. A “1” in an- 
other bit ( E ) indicates an input parity error, and the 
last bit (S) identifies the type of scan (a reproduction of 
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Fig. 4 — Pickup system. 


the corresponding bit in the instruction word). This in- 
formation is all that is required to specify the results of 
the scanner operation to the computer. 

The output tape may also be written in another form. 
In many cases, the photomultiplier outputs alone are 
sufficient for processing; hence, the third line only is re- 
quired on the output tape. Although only two of these 
six bits are useful, writing only the third line is a more 
efficient input to the IBM 704 since the elimination of 
the other two lines will triple the input data rate to the 
IBM 704. (In many recognition procedures, the IBM 
704 is input limited.) Therefore, the option exists in the 
scanner to write only the third line of the output word 
on the output tape. The output tape travels at one- 
third normal speed, thus giving the normal line density. 

IBM 704 Compatibility 

The program tape for the scanner is written by the 
IBM 704 computer on a reel of tape. This reel is then 
placed on the output tape machine, and the program 
information is transferred directly to the continuous 
loop tape machine under the control of the logic unit. 
Later, as the output tape is written, its timing is con- 
trolled by the program tape. That is, when a line enters 
the logic unit from the program tape, a line is also writ- 
ten on the output tape. Therefore, even though the 
speeds of the two scanner tape machines may differ from 
each other and from the IBM 704 tape machines, the 
line spacing on the output tape is, for all practical pur- 
poses, identical to that of the original program tape 
prepared by the IBM 704. By the same process, end-of- 
record and end-of-file gaps written on the input tape 
will appear as the same size gaps on the output tape. 
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Hence, the output tape is compatible with the IBM 704 
computer. 

Program Example 

An example of a simple program may clarify the op- 
eration of the generalized scanner. Fig. 5 illustrates the 
scanner’s use in the study of recognizing numbers writ- 
ten with the double-dot system of constrained writing. 1 
(A number written using two dots as a guide can be rec- 
ognized by noting the crossings of the number with 
radial lines emanating from the dots.) 

Assume that it is desired to program a set of radial 
lines in the form of a cross, and to move this cross 
through the field systematically. The two particular 
crosses that are centered on the constraining dots are 
to be used for the recognition procedure. One of these 
crosses is shown in Fig. 5(a). The first instruction (Xi, 
Fi, B) positions the beam. ( B stands for Blank; 5 1 
stands for Scan.) The next instruction causes the beam 
to trace out one radial line, and so forth. 

Let it be assumed that this particular scan pattern is 
positioned over a blue character written about red dots 
as shown in Fig. 5(b). Then the output tape will appear 
as shown (the first photochannel bit is the blue channel). 
Corresponding to the first instruction, X\, Y\, 0, 0, B is 
written on the output tape (no outputs observed since 
this is a blanked scan). The next instruction will create 
a scan segment that crosses a blue mark, and this is so 
indicated in the output word. The next scan segment 
also crosses a blue line, and so on. 

In interpreting these data, the computer will examine 
all of the crosses for a crossing of a red mark by the scan 
segment from X 6 , F 6 , to X 7, Y-j (the middle of the cross 
as shown in the figure) since this means that the cross is 
aligned with the dot. It will then use the blue crossings 
of that particular cross for the recognition procedure. 
Hence, in this case, the programmable recognition 
machine (scanner and computer) has been able to 
search for a character, to find it, and to identify it. 

Conclusion 

The generalized scanner is shown in Fig. 6. From left 
to right the units are: the power supply cabinet, the pro- 
gram tape machine, the output tape machine, the con- 
sole, and the flying-spot scanner light source and pickup 
units. 

The generalized scanner coupled with the IBM 704 
computer is a powerful laboratory tool which is in- 
tended to greatly facilitate the investigation and evalua- 
tion of various character- and pattern-recognition meth- 
ods. The most important characteristics of the scanner 
in review are 

1) 100-line resolution (10,000 resolvable points); 

2) A scan field variable from 1/8 by 1/8 inch to 3 by 
3 inches; 

1 T. L. Dimond, “Devices for reading handwritten characters.” 
Proc. EJCC, pp. 232-237; 1957. 
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3) Two-color capability; 

4) 60,000-instruction capacity; 

5) Execution rate of 4000 instructions per second ; 

6) IBM 704 compatibility. 

As experience is gained with the generalized scanner, 
it is planned to add a more flexible instruction code to 
allow the scanner to perform some of the functions now 
performed by the IBM 704. This will reduce or eliminate 
computer time for searching and for some data reduc- 
tion operations. Additional instructions will allow scan- 
ner searching for characters and conditional transfers 
between scans. More information will be made available 
from the scanning process, and more efficient methods 
for writing on the output tape will be included. It must 
be stressed that this additional flexibility would serve 
only to reduce the amount of computer time required 
for simulation. The present generalized scanner coupled 
with a digital computer is sufficiently flexible to investi- 
gate any desired character-recognition method. 



1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


295 


File Searching Using Variable Length Keys 

RENE DE LA BRIANDAISf 


M ANY computer applications require the storage 
of large amounts of information within the 
computer’s memory where it will be readily 
available for reference and updating. Quite commonly, 
more storage space is required than is available in the 
computer’s high-speed working memory. It is, therefore, 
a common practice to equip computers with magnetic 
tapes, disks, or drums, or a combination of these to 
provide additional storage. This additional storage is 
always slower in operation than the computer’s working 
memory and therefore care must be taken when using it 
to avoid excessive operating time. 

This paper discusses techniques for use in locating 
records stored within a low-speed memory medium 
where they are identifiable by a key word or words of 
variable length on a machine not equipped to accomplish 
this automatically. The technique is also applicable to the 
conversion of variable word length information into 
fixed length code words. 

When records can be stored in a slower memory medi- 
um in such a fashion that their exact location may be 
determined from the nature of their designation, rea- 
sonably efficient handling procedures can be established. 
However, as is often the case, the records cannot be so 
easily located and it becomes necessary to examine each 
entry in order to locate a particular record. Sequential 
examination of the key words of each record, until the 
desired record is located, is not a satisfactory approach 
on machines not having automatic buffered searching 
facilities, and may not be satisfactory on machines so 
equipped, if, for instance, reels are searched which need 
not be because it is not known in advance that they are 
not needed. Because the average search time for the 
desired records is proportional to the number of records 
stored in this slower memory, the total operating time 
of a program is proportional to the product of the num- 
ber of records stored and the number of records for which 
search is instigated. This product may approach the 
square of the number of records involved. This relation- 
ship between operating time and the number of records 
stored places a definite limitation on the number of 
records which may reasonably be stored by any par- 
ticular program. Fortunately, if the records can be 
stored with the key words in some ordered arrangement, 
an educated guess can then be made as to the location 
of a particular record, and a better system will result. 
However, records cannot always be arranged in such a 
fashion. 

When records are large compared to the key word or 
words, a useful technique is to form an index having in 

f U. S. Naval Ordnance Lab., Corona, Calif. 


it just the key words and the location on the correspond- 
ing record. A particular record is then located by search- 
ing the index to determine the record’s location and 
then taking the most rapid approach in arriving at the 
record. Since only the key words and the locations of the 
corresponding records are stored in the index this tech- 
nique reduces the amount of information which must be 
handled during a search. With the smaller amount of in- 
formation involved it is often possible to utilize the com- 
puter’s high-speed memory for the storage and search- 
ing of the index. Furthermore, this index can now be 
ordered or otherwise subjected to speed-up techniques. 
This index approach often can greatly improve the op- 
erating efficiency of record handling programs. In many 
instances this improvement is sufficient but there are 
also many cases where a further increase in efficiency 
is necessary. In particular, the time required to per- 
form the search when consulting the index may still be 
objectionably large. If this is true then it is necessary to 
apply a speed-up technique to the searching operation. 
Of course these techniques can be applied to any table 
lookup problem where the nature of the key word or 
words does not lead directly to the desired entry. 

Peterson 1 has suggested a method of arranging such 
an index which greatly reduces the lookup time when 
it can be applied. This method, referred to as the 
“bucket method,” calls for randomizing the digits of the 
key word to produce a number which indicates that 
point in the available memory where a particular key 
and its corresponding record location should be stored. 
If this particular space is not available it is stored in 
the next highest (or lowest) available space. When seek- 
ing a particular record, the exact randomization process 
is repeated producing the same indicated point and a 
search is begun from that point in memory and in the 
previously used direction. When using this method, 
facility must be provided to continue from the other end 
when one limit of the available space is reached. During 
the process of placing an entry in this table a record is 
kept of the number of steps which must be taken before 
finding space to store the entry. This number is then 
compared with and, if necessary, replaces the previously 
occurring maximum. This maximum can then be used to 
limit the operation when a search is undertaken for an 
item for which there is no entry. 

The object of this procedure is to distribute the rec- 
ords evenly throughout the available space in spite of 
uneven characteristics which occur because of similar- 
ities in the structure of the keys. A limitation of this 

1 W. W. Peterson, “Addressing for random-access storage,” IBM 
J. Res. Dev., vol. 1, pp. 130-146; April, 1957. 
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method is that it is necessary to store the key as a part 
of the entry in order to identify an item positively 
during the searching phase. This increases the storage 
space used and, as the memory fills up, the average 
number of spaces which must be examined before an 
entry is located increases. This saturation effect greatly 
decreases the operating efficiency. Furthermore, this 
method becomes far too involved from the bookkeeping 
standpoint when the keys are variable word-length 
words which exceed the length of one computer word 
when working with a fixed word-length machine. 

A problem involving variable word-length information 
confronted us in writing a FORTRAN type compiler 
for the Datatron 205. In this version of FORTRAN we 
allow the names of quantities to be of any length and 
they may consist of any number of separate words pro- 
vided there are no intervening special characters. For 
reasons of simplicity in the internal handling, each of 
these external names must be converted to a code word 
of specific length such that it may be stored with other 
information within one cell. Further, it is necessary that 
the external name be preserved and made easily avail- 
able for annotating the finished program. To accomplish 
this, each external word is placed in a file which, for 
future reference, is written on tape as it is formed. The 
position of each entry in this file is then used as the 
code word for the name. Knowing the position of a 
particular external word makes it very simple to recover 
for annotation purposes. 

When it is necessary to have some method of deter- 
mining whether any given word has occurred previously, 
sequentially scanning the previous entries is impractical 
because of the limitations mentioned previously. A more 
desirable situation would be a scheme that in no way 
depended upon the amount of previously stored in- 
formation in the file. If this could be accomplished, the 
operating time would then be more nearly proportional 
to the number of items for which a search was performed 
rather than the product of this number and the total 
number of entries in the file. 

The technique we have developed accomplishes the 
goal. The operating time is related to each letter of the 
external word and is, therefore, proportional to the 
number of letters in each word for which a search is per- 
formed. The size of the file has little effect on the op- 
erating time. Total operating time is proportional to the 
number of external words multiplied by the time re- 
quired for a word of average length. 

In our particular application each external name 
becomes a record which is stored on tape. The location 
of the first word of each record is the code for that ex- 
ternal word. The external word itself is the key. In the 
computer’s main (drum) memory we form an index of 
the key and its corresponding code. The organization 
of this index is the reason for this method’s efficiency. 
We call this the “letter tables” method. The index con- 
sists of a set of tables with the number of tables as well 
as the number of entries in each table varying with 


each running of the program. An address, usually the 
lowest numbered available cell, is assigned as the start- 
ing point of the table of first letters. The key words are 
examined letter by letter and each first letter which 
occurs is entered in the table of first letters if such an 


1st Letter Table * 

2nd Letter Tables 
■3rd Letter Tables 
4th Letter Tables 
5th Letter Tables 
6th Letter Tables 



7th Letter Tables _ 

All entries of any one table are covered by a single arc ( — ). 
Fig. 1 — Formation of a set of tables. 


entry does not already exist (Fig. 1). A new table is 
assigned to each of these letters. It is formed by assign- 
ing it a starting address. Each second letter which occurs 
is entered in the table assigned to the letter which it 
follows. To each of these second letter entries a new table 
is assigned as before. Each third letter is placed in the 
table assigned to the letter pair which it follows. Thus, 
there will be a table for the letters which follow “CA” 
and a different table for those letters which follow “CB” 
if these combinations occur. To each third letter a table 
is assigned and the technique is repeated until all letters 
of the key have been taken care of. 

In our program the words which make up a key are 
compressed to eliminate blank spaces before being 
placed in the table as one word. A blank space is then 
used to signify the end of the word. This blank space is 
stored in the set of tables as a signal that the entry is 
complete and the code word which was previously de- 
termined is stored with this blank instead of an indica- 
tion of a table assigned for the next letter. If preserva- 
tion of the blanks is necessary, a special unique mark 
may be used to signal the end of a key. 

When attempting to determine the previous occur- 
rence of a particular word the procedure is first to scan 
the table of first letters until the desired letter is found. 
The second letter is then compared with the various 
entries in the assigned table until agreement is found for 
the second letter. The third letter is compared in a 
similar manner with the indicated table and the process 
is continued until comparison occurs with a blank. When 
this occurs the desired code may be found in the re- 
mainder of the entry with that blank. In the event that 
the desired letter does not occur in a particular list, it is 
known that this word is not in the index. If this word is 
to be added, it is now necessary to store the remaining 
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letters of the word in the index using the previously de- 
scribed technique. The first letter to be added will be 
the one which did not occur. Once a letter has been 
added to the tables there are no entries in the newly 
formed table so no further searching is necessary, and it 
is only necessary to add each letter remaining in the 
word to the new tables. 

As previously mentioned, the code is stored with the 
blank which signifies the end of the word. This code is 
the next available location for a record in the external 
language file. As soon as the index is complete the ex- 
ternal word, which is this next record, is placed in the 
file. The location indicator is adjusted to indicate the 
next available space in this file and this determines 
what the next code word will be. 

Since the amount of space required for any of the 
tables in this type of operation depends upon the man- 
ner in which the letters happen to follow each other, it 
becomes necessary to assign space to each of the tables 
as it is needed. This is best done by assigning the next 
available space to whichever table is being expanded. 
The programming principles involved in this type of 
operation were first described by Newell and Shaw. 2 
However, since in our application it is not necessary to 
remove entries from the tables as was the case in their 
application, a less involved method than the one they 
described can be applied. If a continuous portion of 
memory can be devoted to the storage of the tables it 
can be utilized in a sequential fashion with the next 
available space being the next cell. A simple counter 
can then be used to keep track of this next available 
space. This operation results in the storage of the various 
tables in an overlapping fashion and, therefore, it is 
necessary that each entry in a table have an indication 
of the location of the next entry in that table. 
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4-digit address 
of assigned table 

4-digit address 
of next entry 

(a) 
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0 0 

CODE 

4- digit address 
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Fig. 2 — (a) Format of a letter entry, (b) Format of a 
word-terminating entry. 


Fig. 2 shows the two types of entries in these tables. 
The letter entry shown in Fig. 2(a) has the letter in the 
two digits on the left end of the word, and the four 
digits on the right end are used for the address of the 
next entry in this table. The remaining four digits spec- 
ify the address of the first word in the assigned table. 


2 A. Newell and J. C. Shaw, “Programming the logic theory ma- 
chine,” Proc. WJCC, pp. 230-240; February, 1957. 


Fig. 2(b) shows the configuration used for the storage 
of a blank which signifies the end of a word. In this 
word, the first two digits are blank, the next four digits 
are the code, and as in the previous case, the last four 
digits indicate the address of the next entry. The end 
of a particular list is signified by the absence of an 
address indicating the next entry. 
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Fig. 3 — Memory distribution of a set of tables. 


Fig. 3 shows a memory distribution which would re- 
sult from the storage of the four words: can, candy, 
count, and could. In this example there are a total of 
twelve separate tables stored in the fifteen spaces. Al- 
though the system may appear more complicated, no 
more bookkeeping is required than in a system of se- 
quentially searching a list of entries where the different 
entries require different numbers of words in memory 
for their storage. 

An additional time-saving feature that can be applied 
with a slight additional cost of memory space is the 
establishment of a full set of possible first letters with 
the corresponding second letter tables assigned starting 
points. By this we mean that if the first letter is “E” the 
first entry in the assigned second letter table will be in 
the fifth cell with respect to the beginning of the tables. 
Those first letters which do not occur are then wasting 
one memory word each. 

Since in this scheme there is only one letter stored in 
each word, it requires far more space than other schemes 
where several letters are stored in one word. However, 
the advantage in scanning speed makes up for this dis- 
advantage and it becomes practical to form several 
tables of this type, storing them in a slower memory 
medium until needed. When doing this, difficulty can 
arise if care is not taken to avoid excessive transfers to 
and from this slower memory. Methods of overcoming 
this problem depend upon the particular application. 
In our application each set of tables is no longer needed 
after one complete pass of the input, and when overflow 
of space occurs, during input, those words which cannot 
be converted are marked. After completion of the first 
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input pass the set of letter tables is erased and a second 
input pass begins at that point in the input data where 
overflow occurred. A new set of tables is formed to con- 
vert the marked words and the process can be repeated 
as often as necessary. 

Now let us look more closely at the technique in an 
effort to determine the operating time. For the sake of 
the following discussion, we shall assume each character 
to be one of 40 possibilities. This list of possibilities could 
include the 26 letters of the alphabet, ten decimal digits, 
a blank, and three special characters. The maximum 
number of comparisons necessary to determine a word 
then comes to 40 for each character in the word includ- 
ing the blank which terminates the word. We find how- 
ever that this maximum is seldom reached. To show 
this, let us assume a file contains 1000 words. If the first 
characters of these words are evenly distributed among 
the 40 possibilities there will be approximately 25 words 
starting with each character. Since 25 words can pro- 
vide only five-eighths of the possible entries in the sec- 
ond letter tables we can expect that to determine a 
word, the average number of comparisons needed will 
be 20 to determine the first letter, 13 to determine the 
second letter, and thereafter only one per letter. Thus 
a nine-letter word including the blank might require an 
average of approximately 40 comparisons. 

Now we increase the number of words to 10,000 and 
we find that we have an average of 250 words per char- 
acter in the first letter table, 62 \ per character in the 
second letter tables, and approximately one and one- 
half in the third letter tables. The average number of 
comparisons for a nine-letter word now comes to 20 for 
each of the first three letters and one for each of the re- 
maining letters. This new total of 66 is 1.65 times greater 
than the previous average. 

When the words stored in such a fashion are taken 
from some formal language such as English, the num- 
ber of words beginning with certain characters tends to 
increase and, therefore, the possibility of having all of 
the various characters occur in the table of first letters 
is decreased. This decreases the average number of 
comparisons needed to determine the first letter. Fur- 
thermore, the number of letters which normally might 
follow a particular letter is limited so that the average 
number of comparisons is reduced for subsequent letters 
also. 

For example we normally expect “U” to follow “Q” 
and one of the vowels or the letters “H,” “L,” “R,” or 
“Y” to follow the letter “C.” Thus we find we are able 
to adjust favorably the averages we determined previ- 
ously due to bunching, a phenomenon which usually 
leads to decreased efficiency in other methods. In the in- 
stance of the 10,000-word file we might expect the aver- 
ages to be more like 16, 12, 8, 3, and one thereafter 
which would be 44 for the nine-letter word, an improve- 
ment of one-third. 


We shall now attempt to compare this technique with 
Peterson’s “bucket method.” The bunching which we 
described as useful to us must be overcome when using 
the “bucket method.” This is usually done by generat- 
ing a number which is influenced by all characters of the 
word and yet appears to be random with respect to 
them. This is extremely difficult when dealing with vari- 
able word length information, especially with long words 
where only one letter differs or where the letters are the 
same but two have been interchanged. Other difficulties 
encountered with the “bucket method” include termi- 
nating the search when enough entries have been ex- 
amined to know that the word is not in the file and then 
finding suitable space to insert the word. The resultant 
bookkeeping can actually consume many times more 
operating time than the actual comparison operation 
requires. Thus, although fewer comparisons may be re- 
quired when using the “bucket method,” due to the vari- 
able word length problems, the operating time is 
pushed up into the same range as that of the “letter 
tables method” which we have described. 

Another way in which the two methods must be com- 
pared is with regard to the amount of memory required 
for the storage of similar amounts of information. In a 
fixed word length machine up to all but one character 
might be wasted with each word stored when using the 
“bucket method.” Also, when a minimum of two adja- 
cent cells are used, one for the word and one for the code, 
an occasional word will be lost due to storage of a word 
requiring an odd number of cells in such a position as to 
leave only one unused cell between itself and an adja- 
cent entry. The amount of memory space required for 
the storage of a particular amount of information in the 
“letter tables method” cannot be specifically determined 
because it is quite dependent upon the number of repeti- 
tions of letter sequences which occur. The number of 
cells required will always be greater than the number of 
words and may in some instances approach the total 
number of characters stored. This means that the “letter 
tables method” will probably require from two to six 
times as much memory space as the “bucket method” 
for a similar amount of information. 

The amount of code required by the “bucket method” 
may run three to five times as much as for the “letter 
tables method” depending on the application. Also, this 
latter method could quite probably be written as a sub- 
routine or by a generator in a compiling routine with 
much greater ease than could the “bucket method.” 
The final choice of method depends on details of the 
specific problem and also on operating characteristics 
of the machine on which it is to be run. It may in some 
cases be necessary to program and run tests before a 
final determination can be made. Both methods are an 
order of magnitude faster than the simple sequential 
search and we have found them both to be of value in 
different parts of the FORTRAN for Datatron Project. 
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Program Design to Achieve Maximum Utilization 
in a Real-Time Computing System 

A. FREDERICK ROSENEf 


I N THE DESIGN of a real-time computing system, 
a major problem is the selection of machine capacity 
and speed adequate to handle the maximum data- 
processing rates and yet not burden the system at 
other times with unused machine capability. All consid- 
erations — cost, efficiency, size, good design practice — 
require maximum utilization of the computer’s com- 
plete capability. The design of the computer program to 
achieve maximum utilization is especially difficult in 
real-time systems which must be capable of handling a 
wide variation in data rate. 

A real-time program design problem of this type was 
faced in the development of the Fire Control Center for 
the PLATO antimissile system. In addition to the re- 
quirement of handling a wide variation in traffic rate in 
real time, a Fire Control computing system must also 
be small and mobile. The program design which was de- 
veloped illustrates one way in which a high-speed com- 
puter may be used efficiently for such a problem and is 
general enough to be applied to other real-time problems 
such as transportation or communications systems con- 
trol. 

The basic property of a real-time system is that cer- 
tain commitments must be met at particular times no 
matter what the load on the system. The method of ap- 
proach considered here uses as a design base a timing 
interval defined as the largest interval in which the 
essential outputs are required no more than once. The 
size of this interval, as a consequence of this definition, 
is determined by the external dynamic and physical en- 
vironment, and is not dependent on the computer being 
used. 

In addition to the essential set of outputs produced 
during a basic timing interval, there are also outputs, in 
most real-time systems, whose computation may be de- 
layed to a later time. In the Fire Control application 
such tasks as predicting target position or selecting 
launch sites for a defensive missile could be delayed, 
whereas computing a command for a defensive missile 
which is airborne may not be delayed. In order to reduce 
computing time when an overload is present, two ave- 
nues of approach are available. (See Fig. 1.) First, the 
computation of the quantities which are not essential 
during the present interval can be delayed. However, 
before this is done, the expected load in the future, when 

f Missile Systems Lab., Sylvania Elec. Products, Inc., Waltham, 
Mass. 


the output of this computation can be delayed no longer, 
will have to be considered. The second alternative would 
be a change in the method by which the essential out- 
puts are formed ; that is, a sacrifice of quality for quan- 
tity. The high speed at which computers work makes it 
impossible for human intervention to control the selec- 
tion of what is processed and the manner in which it is 
processed. Therefore, it is necessary for the computer to 
determine what it should do and how it should do it by 
examining not only present conditions but also pre- 
dicted future conditions. 


1) Delay of nonessential computation. 

2) Sacrifice of quality for quantity. 


Fig. 1 — -Methods for reducing computing time. 

A program for providing this type of control for a 
large system would soon become a logical monster offer- 
ing little flexibility of operation and an almost impossi- 
ble debugging job unless some type of division of re- 
sponsibility can be obtained. The basic principle of de- 
sign which was applied is a division of the system into 
sections, each of which has a control program associ- 
ated with it. These sections are connected and con- 
trolled by a central control program which imposes a 
given strategy on the system by the manner in which it 
connects the sections of the system. One outstanding 
advantage of this type of program design is the flexibil- 
ity inherent in the control programs as well as in the 
operational programs. Also, this design enables several 
sections of the system to be programmed concurrently 
by different programmers without the worry of incom- 
patible results. 

In the design of the PLATO Fire Control Center, con- 
sideration was first given to defining levels of decision 
which provide a basis for proper division of the system 
into sections. This was followed by a detailed design of 
the different sections and their associated data-storage 
design. Of course, during this design period full consid- 
eration was given to possible orderings of simulation ex- 
periments for the efficient analysis of the system. 

Division of the System by Decision Levels 

The system recognizes three levels of decisions: those 
appearing in central control, subcontrol, and system 
significant blocks. (See Fig. 2.) The basic unit of the 
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system is the system significant block in which all opera- 
tional programs appear. Associated with each of these 
blocks is a subcontrol which controls its operations. The 
highest level, central control, controls the relationships 
between subcontrol-system significant block combina- 
tions. 

The properties which a system significant block 
should satisfy are: 

1) Any decision made in the system significant block 
must be a decision directly dependent on output 
within that system significant block, and must 
directly control the operation within that block. 

2) Any functional change in the system significant 
block should not affect the rest of the system. 

3) Any functional change in the rest of the system 
should not affect the system significant block. 

4) Any section of the system significant block which 
satisfies the above three conditions must not be 
directly controlled by a timing decision (a deci- 
sion determining how time is used). 

Therefore, a system significant block becomes a black 
box provided with inputs which in turn generates out- 
puts. The factor determining how the system is divided 
into system significant blocks is that no decision con- 
trolling operation of the rest of the system nor timing 
decision can be included within a block. Also, as the 
name implies, the system significant block should repre- 
sent an operation which is meaningful to the system as 



Fig. 2 — Division of system by decision levels. 


a whole. For example, a Fire Control Center which must 
interpret data from a radar and, on the basis of these 
data, launch and guide a defensive missile to intercep- 
tion of an enemy missile, might be divided into systems 
significant blocks in the following manner. (See Fig. 3.) 

1) A block which detects the presence of a target by 
examining the data received from the radar. 

2) A block which smooths the target data and pre- 
dicts the position of the target. 

3) A block which predicts the position of the defen- 
sive missile. 

4) A block which computes commands which are sent 
to the defensive missile. 

In any computing system, the juggling of the ordering 
of decision can make it possible for almost any block to 
satisfy the properties of a system significant block. 
Thus, one would begin the division of a system by de- 
termining the operations present which actually possess 
significance to the system as a whole. Then the blocks 
performing these operations can be designed according 
to the four properties already listed. 

Each system significant block will have associated 
with it a subcontrol block which controls its operation. 
A subcontrol block is composed of decision operations 
which are directly dependent on the output of a central 
control and/or one or more subcontrol blocks, and the 
result of these decision operations directly control the 
operation of one system significant block. This inter- 
mediate level of control associated with each system 
significant block provides a division of control into sub- 
sections in the same manner as the system significant 
block division divided the operational program into 
more manageable subsections. The subcontrol block has 
the responsibility for making all decisions, time-de- 
pendent or otherwise, which affect the operation of its 
system significant block, that is, affect the method in 
which the block is used. By way of illustration, the sub- 
control of a system significant block which predicts fu- 
ture target position might be required to determine if it 
has been allotted enough time to predict the position of 
all targets which have been detected by the best mathe- 
matical method. When there is not enough time, the 



Fig. 3— Example of system significant block breakdown of a Fire Control Center. 
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positions of some targets will have to be predicted using 
a less accurate but shorter method ; and the subcontrol 
will have to determine which targets should be processed 
using the shorter method. 

The highest decision level is the central control which 
contains all decision operations directly concerning 
strategy. A decision at this level is directly dependent 
on the output of one or more subcontrol blocks and di- 
rectly controls the operations of one or more subcontrol 
blocks. One important reason for separating decisions of 
this type from those appearing in the subcontrol blocks 
is that changes in strategy could be frequently required 
by a changing tactical situation. 

Unit Design 

The properties already discussed, which are associated 
with each of the three levels of the system, set the 
bounds for the detailed design of each level. 

The structure of the system significant block, which 
is the basic unit of the system, is a collection of calling 
sequences. (See Fig. 4.) These calling sequences may be 
considered as macro instructions from which a program 
is formed for carrying out a designated task. The normal 
return of each macro instruction contains an uncondi- 
tional transfer instruction; the addresses of these trans- 
fer instructions are inserted by the subcontrol block of 
the system significant block. This insertion of addresses, 
accompanied by the storing or changing of parameters 
in the macro instructions where needed, is the only 
operation necessary to form a program. Hence, the sub- 
control block effectively writes a program by inserting 
transfer addresses and parameters in macro instruction. 
However, in order to accomplish this the subcontrol 
must determine on the basis of data supplied by central 
control what tasks are waiting to be done, in what order 
they should be done, and how they should be done. 


Provision must also be made for an overload of work, 
that is, a selective process for determining what tasks 
should be delayed until a later time. In order to do this, 
information such as present work load, time available 
for the subcontrol and system significant block com- 
bination being used, priorities of tasks waiting to be 
done, and any restrictions placed on waiting tasks must 
be available. (See Fig. 5.) 

The first function of the subcontrol will be to select 
a task. This selection will be predominately based on a 


Enter Subroutine 1 

Parameters and 
Data Storage 
Locations 

Normal Return* 

Enter Subroutine 2 

Parameters and 
Data Storage 
Locations 

Normal Return* 

Enter Subroutine 3 

Parameters and 
Data Storage 
Locations 

Normal Return* 

Enter Subroutine n 

Parameters and 
Data Storage 
Locations 

Normal Return* 



Calling Sequences 


* Normal returns are transfer instructions whose addresses are 
supplied by subcontrol. 

Fig. 4 — Structure of a system significant block. 


Enter Subcontrol 



Fig. 5 — Structure of a subcontrol block. 
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comparison of priority to time available; however, work 
load could also be a factor. Once a task has been se- 
lected, it is necessary to determine how this task should 
be carried out. There are actually two types of selec- 
tions which must be made at this point: a selection of 
method based on the same quantities on which selec- 
tions of tasks were based, and a selection which is com- 
pletely independent of timing requirements but based 
on the state of a given task. The first type of selection 
is concerned with quantity vs quality. In other words, 
as demands on time increase, accuracy may be sacrificed 
in order to increase the number of outputs computed 
during a computational interval. This type of decision 
must be based on parameters computed by the central 
control since it is concerned with the strategy being 
employed. The second type of decision is concerned 
with how much has already been done on a certain task 
and on the basis of past results what should be done 
next. During initial simulations there will be the need 
for the analyst to select arbitrarily a given method he 
wishes to evaluate. For this purpose a manual selec- 
tion switch not under control of the subcontrol is 
included. 

One final duty of the subcontrol is to place all inputs 
and outputs of the system significant block on tape in 
the binary mode if they are desired for historical records 
and use in future analysis of system operation. 

The central control block is only entered at the be- 
ginning of each basic timing interval. (See Fig. 6.) At 
this time, the subcontrol-system significant combina- 
tions which are to be used, the order in which they 
should be used, and the amount of time allotted to each, 
is determined. In order to accomplish this, the central 
control must be able to assess the work load expected 
during this and succeeding intervals, and the relative 
importance of each task to be performed by the system 
significant blocks. This type of information would be 
based on such quantities as probability of a task obtain- 
ing its objective, the size of the waiting line for a given 
task, time required by each system significant block, and 
necessary data rates allotted to a given task. It would 
also be the duty of central control to interpret external 
commands which have been supplied since the last in- 
terval. This would include information regarding func- 
tioning of different sections of the system, changes in 


strategy, and instructions concerning the deletion of 
certain data. 

Such things as error checks, system performance 
checks, and diagnostic programs will be handled by 
central control or at least controlled by central control. 
Signals indicating errors either in the computers or in 
other equipment such as data links will be interpreted 
by central control and the necessary action taken. This 
action would include such things as bypassing computa- 
tions which cannot be handled with a given part of the 
system nonoperable, the switching-in of spare equip- 
ment, the calling-in of diagnostic programs to help lo- 
cate the cause of an error, and the issuance of corrective 
instructions, if possible, where incorrect outputs have 
been given. System performance checks would indicate 
the way the system is operating and could be used to 
determine if an alternative strategy is required. Some 
examples of the types of decisions which central control 
may have to make are: 

1) An external signal indicates that one launch site 
cannot be used. What action should be taken? The 
block which performs launch site selection would 
have to be notified ; also, time must be allotted dur- 
ing the next few timing intervals for reassignment 
of launch sites for any interception already planned 
using the inoperative site. 

2) There is a large number of targets requiring pre- 
diction and smoothing. How should the next time 
interval be used in order to reduce the size of this 
waiting line? During the next interval, as much 
time as possible would have to be allotted for the 
prediction system significant block. Hence, the 
other blocks would be allotted a minimum amount 
of time or bypass completely. The parameters sent 
to the prediction subcontrol would be such that 
all prediction would be done by the shortest 
method. 

The exact structure of the central control cannot be 
specified in detail since the functions it must carry out 
are dependent on strategy. However, ideally, its func- 
tion is to apply a strategy to the present conditions in 
order to produce a course of action which is consistent 
with immediate operational requirements and results in 
the least probability of system saturation. 


Representative Inputs 
Size of Waiting Lines 

Data Rates 

Error Alarms 

Data from Control Center 

System Performance Results 
Fig. 


Central 

Control 

Strategy 


Representative Results 
Ordering and Time Allotments 
•for SS Blocks During Next Interval 

Parameter Changes 

> 

External Outputs to Monitors 


6 — Functions of central control. 
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Subroutine Design 

The macro instructions which compose system signifi- 
cant block have been described as calling sequences to 
subroutines. (See Fig. 7.) These subroutines are divided 
into three levels according to their use in the system, 
and hence the manner in which they must be written. 
The calling sequences of those in the top level only ap- 
pear in system significant blocks and usually in only 
one block. The function performed by this type of sub- 
routine is usually system dependent; that is, the func- 
tion could be done in many ways and simulation and anal- 
ysis will determine which method is best for a given 
mode of operation. Calling sequences of subroutines 
from lower levels could appear within a subroutine at 
this level. The calling sequences of a second level sub- 
routine could appear in a system significant block or in 
a top level subroutine. The functions performed by a 
second level subroutine are explicitly defined mathe- 
matical or logical tasks and their use does not directly 
depend on mode of operation of the system. This level 
subroutine can only contain calling sequences of sub- 
routines from the lowest level. 

All subroutines in the top two levels are written in 
closed form and the location of all inputs and outputs 
are specified in the calling sequence. Provision is made 
within each subroutine for storing all inputs and outputs 
on tape under the control of its calling sequence so that 
any data needed for analysis purposes are readily ob- 
tained if desired. One of the differences between these 
two levels of subroutines is the way in which they are 
written; that is, the second level programs are com- 


Data-Storage Design 

Another requirement of program design is the specifi- 
cation of the data storage. The data storage for a real- 
time system must be set up with the specific purposes of 
the system in mind and be compatible with the program 
design. Output requirements are one of the important 
considerations in designing data storage. System signifi- 
cant blocks and subroutines on the top two levels have 
provision for placing their inputs and outputs on tape. 
The data supplied by individual subroutines are neces- 
sary for analysis of the programs involved, whereas 
once the system is actually in operation, the output sup- 
plied by the subcontrol blocks is of prime importance. 
Since, in many cases, there is a minimum of time avail- 
able for output in an operating system, the data-storage 
design should be such that the system significant block 
outputs require a minimum amount of computational 
time. 

Even after a system is operating, it may be neces- 
sary to change or modify the strategy being employed. 
In order to expedite this type of change, separate stor- 
age for control data is advisable. In systems where ran- 
dom-access memory is limited, data not being used 
would be transferred to some auxiliary type storage such 
as drums. The data storage in this case would have to 
be designed with this transference of data in mind. 

Simulation Experiments 

The first units of the system to be simulated are the 
subroutines. (See Fig. 8.) The purpose of this phase of 
simulation is to evaluate the mathematical models used 


Level 

Use 

Form 

Location of Calling 
Sequences 

Subroutines Used by 
a Subroutine 

Tape Output 
Provided 

Error 

Returns 

1 

Dependent 
on Mode of 

System Operation 

Specialized 

ss 

Levels 2 or 3 

Yes 

Yes 

2 

Not Dependent 
on Mode of 

System Operation 

General 

SS or Level 1 

Level 3 

Yes 

Yes 

3 

Utility Programs 

General 

Levels 1 or 2 

None 

No 

Yes 


Fig. 7 — Subroutine levels. 


pletely general, whereas top level programs can be more 
specialized. 

The third level of subroutines consists of utility pro- 
grams, and the calling sequences of this type of program 
appear only within other subroutines of higher levels. 
All utility programs are written in a general closed form, 
and no provision for tape storage of inputs and outputs 
is provided. 

Subroutines from all levels have error returns for over- 
flow, division failure, and nonacceptable inputs. 


Phase 


I /a) Subroutine. 

\b) Groups of subroutines. 


( a) System significant block with all or part of its 
subcontrol. 

b) Groups of system significant blocks each with 
its subcontrol and part of central control. 


Phase III Complete system including peripheral equip- 
ment. 


Fig. 8 — Ordering of simulation experiments. 
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over the range of inputs expected. In some cases, this 
simulation consists of connecting several subroutines 
together. The second phase consists of system signifi- 
cant block simulation accompanied by subcontrol con- 
trol. During this phase, the different methods available 
for performing a function are assessed and the optimum 
selected. More than one method per function is kept in 
some cases because of time considerations, e.g., time 
available vs accuracy needed. Sections of the control pro- 
gram used during this phase become part of the sub- 
control of the blocks being simulated. The third phase, 
the entire simulation, is reached by successively com- 
bining system significant blocks and adding the appro- 
priate sections of the central control. Adequate evalua- 
tion of programs at any level is impractical without 
realistic inputs which are most easily supplied by other 
subcontrol system significant block combinations and 


central control; thus complete system simulation seems 
necessary. Also, a synthetic environment would have 
to be provided by peripheral subsystem simulation of 
such things as radar and missile. 

Conclusion 

For the PLATO antimissile system Fire Control 
Center, a program design was developed which embodies 
three decision levels: central control, subcontrol, and sys- 
tem significant block. This versatile program design pro- 
vides for the maximum machine utilization, and permits 
a trade-off between quality and quantity of computa- 
tion as well as expediting programming, debugging, and 
simulation. 

The general concepts of this program-design ap- 
proach should be applicable to other real-time comput- 
ing systems in the business and scientific fields. 


Pattern and Character Recognition Systems — Picture 
Processing by Nets of Neuron-Like Elements 

L. A. KAMENTSKYf 


Introduction 

S PATIAL pattern recognition, of which the recog - 
nition of alpha numeric characters is a subclass, is 
an important and practical problem. More efficient 
coding of transmitted pictorial information and more 
efficient utilization of humanly produced information 
could result from its solution. The problem of pattern 
recognition has been stated as the assignment of a mean- 
ingful code to a recognizable structure in a set of sig- 
nals. 1 The signals, in this case organized spatially, are 
the result of a transformation from a visual picture field 
P to an electrical representation of this field. The points 
of this signal field 5 correspond to a characteristic of 
points in the picture. In this study, the reflectivity of 
given picture-point areas is quantized as black or white 
as a basis for a two-state electrical-signal representation 
of points of a pattern. 

The total information content of this signal field is 
certainly less than that of the original picture. Useful 
pattern recognition requires that the code assigned to a 
pattern have even less information content than the 
signal field. For example, a code assigned to a pattern 
of a number in the set of 10-decimat-number symbols 

f Bell Telephone Labs., Inc., Murray Hill, N. J. 

1 L. D. Harmon, “Computer simulation of pattern -recognition,” 
presented at Symp. on Pattern Recognition, Ann Arbor, Mich. ; 
October 22, 1957. 


indicates the number value but need not reflect the size, 
position, nor quality of the original pattern. The pat- 
tern-recognition machine is thus a device for performing 
an information-destructive transformation on the signal 
field to yield an output code assigned to the value of 
the pattern. Internally, the machine may perform a 
sequence of information-destructive transformations. 
Furthermore, the machine may internally produce 
transformations yielding parameters of the picture as an 
intermediate step, rather than the output code directly. 
Such parameters may have a physical meaning. For 
example, in useful number recognition the presence of 
the pattern parameters, straight lines, openings in 
curves lines, or corners is significant for generating the 
output code. 

This paper describes an approach to the solution of 
pattern recognition that may be characterized as 
“spatial operations by neuron-like elements.” Signal 
fields are transformed by a predetermined network of 
threshold-responsive elements. These elements have 
been called neuron-like in that they have many inputs 
and a single all-or-nothing output; they are connected 
in spatial arrays with excitation or inhibition gating 
between the signal field and the elements, or between 
the output and inputs of elements. More correctly, they 
should be called spatial neurons, since only their spatial 
properties approach the assumed properties of neurons. 
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(The coined name "speuron” will be used in this paper.) 
As will be shown, spatial gating only has been used and 
none of the complicated temporal properties of neurons 
have been simulated. However, speurons have really 
been applied for their usefulness in performing many 
different information-destructive transformations. They 
operate on all signal-field points simultaneously under 
the centralized control of a program. A model of a 
simplified speuron net was built. Included in this paper 
are illustrations, from the output of this model, of useful 
pattern filtering (noise reduction, width reduction, etc.) 
and pattern-parameter (straight lines, corners, etc.) ex- 
traction transformations. 

Classification of Methods 

Solutions of the pattern-recognition problem may be 
classified by the nature of the required information- 
destructive transformations. Remembering that the in- 
put to a pattern recognizer is the signal field defined in 
the first paragraph, and that the output is a code as- 
signed to represent a class of patterns having a recogniz- 
able structure, let us develop the place of the neuron- 
like net. 


lowing the contours of a pattern and changes in direction 
of a tracing head can be used as the features of the pat- 
tern. In recognition of double-dot writing, 5 searches are 
made for written lines in seven specific areas of the signal 
field. In effect this searching system codes the feature, 
"Are partially- or fully-closed loops of black area present 
in this pattern?” 

Feature Extraction by Spatial Transformations 

The signal field is usually resolved into n independent 
elements. During a spatial transformation, the state of 
each element is examined. The state of this element and 
other elements whose coordinates are specified with re- 
spect to the examined one are functionally related by a 
specific rule to determine the transformed state of the 
examined element. There are many rules for trans- 
forming patterns. 6-9 A study of Fig. 1 may help to clar- 
ify this. Three such transformations are shown here. 
Consider neighbors in this case as adjacent horizontal- 
vertical elements. 

Although one investigator 9 has proposed a parallel 
transformation system, all experimental studies of 
spatial transformations have been made using a digital 


Element Matching 

The signal field is resolved into n independent ele- 
ments and each input pattern is represented by an n bit 
code. Recognition is effected by matching an input code 
with codes representing the configurations of independ- 
ent elements of each of the set of recognizable patterns. 
All possible patterns would be represented by a code 
table containing <2 n entries. Logical 2 or statistical 
techniques 3 may be used to find the correct or best fit 
of the input and recognizable pattern fields. 


INPUT 



PATTERN 


Feature Matching 

The individual elements of many patterns, however, 
are not independent, since recognizable patterns contain 
constraints on form. Parts of patterns can be classified 
in terms of independent groups of elements, instead of 
by individual elements. Some of these groups include 
the geometrical parameters, straight, curved, closed or 
opened, and breaks or corners. These may be independ- 
ent of absolute position, size, noise, and some changes 
of form. We shall call these "features of the pattern.” 
Recognition of patterns is possible if a sufficient set of 
relevant features can be extracted from the signal field. 

Searching for Features 

This group of pattern recognizers effects feature- 
extracting transformations by searching specific areas 
of the signal field. Edge tracing 4 can be applied to fol- 


2 “Electronic Reading Automation,” The Solartron Electronics 
Group, Ltd. 

3 H. T. Glantz, “On the recognition of information with a digital 
computer,” J. Assoc. Comp. Mach., vol. 4, pp. 178-188; 1957. 

4 J. Loeb, “Communication theory of transmission of simple 
drawings,” in “Communication Theory,” Willis Jackson, ed., But- 
terworth Scientific Pubis., London, Eng., pp. 323-325; 1953. 



S IS THE POINT UNDER EXAMINATION AND HAS VALUE ONE 
IF BLACK 

N IS THE NUMBER OF ITS NEAREST NEIGHBORS THAT 
ARE BLACK 

Fig. 1 — Different transformations of the above pattern. 


5 T. L. Dimond, “Devices for reading handwritten characters,” 
Proc. EJCC, pp. 232-237; 1957. 

6 O. G. Selfridge, “Pattern recognition and modern computers”, 
Proc. WJCC, pp. 91-93; 1955. 

7 G. P. Dinneen, “Programming pattern recognition,” Proc. 
WJCC, pp. 94-100; 1955. 

8 L. Cahn, R. A. Kirsch, L. C. Ray, and G. Urban, “Experiments 
in processing pictorial information with a digital computer,” Proc. 
EJCC, pp. 221-229; 1957. 

9 S. H. Unger, “A new type of computer oriented toward spatial 
problems,” Proc. WJCC ; 1958. 
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computer. Each point must be examined sequentially 
until the complete field has been transformed. Although 
all spatial transformations to be described can be done 
on a digital computer, a relatively long time is required 
to relate spatial arrays of signals. As a result, parallel 
operations have been thought essential to performing 
useful pattern recognition. 1 - 10 

Spatial Computers 

Unger 9 has proposed a program-controlled computer 
consisting of a spatial array of identical arithmetic 
modules which are connected to adjacent modules and 
to corresponding signal field points. Each module con- 
sists of a one-bit accumulator, about six bits of memory, 
and associated logic. The logical arithmetic operations 
this system could perform have been discussed by Unger. 

The utility of serial computing has been questioned 
and has led to proposals for spatial computing. It is 
proposed that the utility of doing arithmetic operations 
at all be questioned as an answer to the pattern- 
recognition problem. Threshold-responsive elements 
(speurons) are proposed as one other answer, because : 

1) They are simple; a model was constructed using a 
transistor per speuron. 

2) They can be connected in different ways; each ele- 
ment can have many inputs; the associated logic 
between connections is simple. 

3) They can transform by many different rules with 
a given connection by changing a common supply 
voltage. 

4) They can be used to recognize patterns by succes- 
sively transforming a signal field under program 
control. 

It is demonstrated here that certain speuron operations 
will clean up patterns or extract certain features; that 
is, an output state or given sequence of states will ap- 
pear, after a given sequence of transformations, only at 
points in the signal field where a certain feature is pres- 
ent. It is hoped that speuron subroutines will be found 
to uniquely extract sufficient sets of features of useful 
sets of patterns. One subroutine for each feature will be 
run sequentially on the machine. The presence or ab- 
sence of a given set of relevant features will be the basis 
for recognizing patterns, like numbers or letters. 

Neuron-Like Elements 

The basic element of the neuron-like nets is shown in 
Fig. 2. The general elements Nj may have any num- 
ber n of inputs Xij each taking on the value 0, +1, or 
— 1. All elements are controlled by setting a “thresh- 
old ZJ Z can take on the values 0 to n. The elements 
have a single output Yj. Yj is either 0 or 1 based on the 
criteria : 

10 O. G. Selfridge, “Computers and pattern recognition, ” presented 
at meeting of Amer. Assoc. Advance Sci. ; December 27, 1956. 


Xij*0,+ l,-| 


Y*0, 1 


INPUT 



THRESHOLD 

CONTROL 


Yj » I IF S Xjj i Z 
‘ i*l 


Yj «OIF X Xjj <2 
i*l 


Fig. 2 — The basic neuron-like element. 


Yj =1 if E Xij > z 

i= 1 
n 

Yj = 0 if 2 Xij < Z. 

In the general case, Xij and Yj are different for each 
element, but Z is the same for all elements of a given net . 

Input to and Output from the Elements 

Other logical elements must be connected to the basic 
element to provide for successive transformations and 
to increase the number of operations of a speuron net. 
A general form for the input and output of speurons is 
shown in Fig. 3. The inputs Iij used to control gates can 
take on the value 0 or 1. The gated signals Ci, the con- 
trol inputs to the speurons, can take on the values 0, 
+ 1, or —1. Ci is not necessarily equal to C 2 or C 3 , etc.; 
however, all Ci’s, C 2 ’s, etc., all Z ’ s and flip-flop reset 
signals R are connected together. Xij may be related to 
Ci and Iij by the following table: 


Ci 

Iij 

Xij 

— 

0 

0 

+1 

+ 1 

+1 

-1 

+ 1 

-1 

0 

i 

+ 1 1 

0 


Net Connections 

Two sets of net connections have been studied. In 
the first, called the nearest neighbor connection, a field 
point and neighbors surrounding it are connected to a 
corresponding speuron. In the second, called the di- 
rected connection, a field point and points along specific 
radii emanating from this point are connected to a cor- 
responding speuron. 
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Nearest Neighbors Connections 

Each of nine inputs Iij of a threshold-responsive ele- 
ment Dj is connected to a point in the signal field Sj cor- 
responding to the position of Dj and to one of its eight 
nearest neighbors. This connectivity may be extended 
to next-nearest neighbors, etc. In a simplification of this 
connection, shown in Fig. 4, each of five inputs Iij to 
speuron Dj is connected to Sj and to one of its four hori- 
zontal-vertical nearest neighbors. The latter connection 
was used in the model to be described. It should be 
pointed out that neighboring and corresponding input 
points add or subtract in a manner depending on the 
values of Ci. To effect successive transformations, the 
output of each element Oj is connected back to the cor- 
responding input Sj through a gate which is opened for 
each transformation cycle. It is assumed that an input 
signal Sj can change to the same state as Oj after each 
transformation cycle. 

Directed Connection 

The inputs of an element Dj are connected to the 
corresponding signal point Sj and to signal points along 
radii emanating from Sj. A directed connection with 
horizontal- vertical radii is shown in Fig. 5. A range of 
influence and the number and the angles of the radii 
must be defined for each net. The directions of influ- 
ence on all speurons are controlled by the voltages Ci. 
There is one common control for each radius and one 
control for the connection to the corresponding field 
point. Transformations can be generated which extract 
the feature, “Does a signal field point lie within a par- 
tially- or completely-closed loop of black area, or is it 
part of a black area?” Settings of Z will produce trans- 
formations which depend on the width of pattern lines 
or on the number of pattern lines in a given direction. 

A Simple Model 

A 30-element model neuron-like net was built to 
study the feasibility of this device and to allow the au- 


SI6NAL FIELO 




thor to experiment with different transformations. Al- 
though all transformations can be simulated on a com- 
puter, the use of the model enabled the transformations 
to be evaluated faster. 

The model’s input signal-field toggle switches, 
neuron-like elements, and lamp outputs were arranged 
in a six-by-five matrix. The input-toggle switches were 
each connected to four leads with phone-tip ends to al- 
low any input to be connected to the input jacks of any 
speurons. Each element is a Kirchhoff adder driving a 
biased transistor. A circuit diagram of five elements and 
their input-output is shown in Fig. 6. These correspond 
to the simple Nj elements shown in Fig. 2, except for one 
input which can be controlled (as in Fig. 3) to be made 
positive, negative, or zero if the signal point connected 
to that input is one.|This input was always connected to 
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OUTPUT LIGHTS 
033 


INPUT 

SWITCHES 

133 



-67.5 V 

6SELF POS H4 
•SELF ZERO 
fSELF NEG 


— 6V 


+67 5 V 


Fig. 6 — Circuit diagram of 5 model elements with 
nearest neighbor connection. 


the signal pattern position corresponding to the neuron- 
like element itself. This resulted in positive summing in- 
puts from the four neighboring field points and adding, 
subtracting, or zero inputs from the corresponding or 
self-position. Four Z settings were built into the model. 
With the three self-input settings, 12 operations are 
possible. A lamp was used to indicate if the output pat- 
tern of each element was a one. Either the input or out- 
put pattern can be displayed on the lamp array. A pho- 
tograph of the model is shown in Fig. 7. 

Successive transformations were made by noting the 
pattern of output lights and setting up this pattern on 
the toggle-switch input array. The model was connected 
using both nearest neighbor and directed connections. 
The model successfully indicated useful nearest neigh- 
bor connection transformations. It was found that 
there were not enough elements to do anything useful 
with the directed connection. Transformations by the 
model are illustrated in the next section. 

Model's Nearest Neighbor Transformations 

Illustrations from the model’s display of its 12 opera- 
tions are shown in Fig. 8. To avoid confusion a different 
input pattern was used to illustrate each threshold set- 
ting. In the first column are the input patterns. In the 
second column are the patterns obtained with the posi- 
tive self-connection; the third column shows zero self- 
connections; the fourth column illustrates negative self- 


connections. The four threshold settings of Z— 1 , 2, 3, 
and 4 are shown on rows 1, 2, 3, and 4. Note that the 
transformations in the upper left fill in patterns, while 
those in the lower right reduce patterns. If all of the in- 
puts were controlled as in Fig. 3, the Ci controls would 
make the transformations directionally selective among 
15 alternatives. 

Some basic picture-processing operations are shown 
in Fig. 9. The results are self-explanatory. The values of 
Z are the threshold setting ; the values of S' are E for an 
adding or self-exciting connection, 0 for a self-zero or 
ignoring connection, and I for a subtracting or self- 
inhibiting connection. If additional controls Ci were 
used, patttern thinning and corner finding could be 
made directional operations. For example, only lower- 
left corners could be indicated on the output. 

Fig. 10 illustrates operations on horizontal, vertical, 
and diagonal lines, vees and tees. Transformations are 
possible which reduce vees or tees to a single point. Some 
of the other patterns are interesting, especially the ones 
showing the invariance of the vee to the self-settings. 
The lower four photographs show the behavior of iso- 
lated lines to the transformation with Z = 2 and S—I. 

Successive transformations with Z — 2 and 5 = 0 are 
shown in Fig. 11. In one case, that of a nonsquare “L,” 
successive transformations result in an alternation of 
two patterns after a number of transformations, the 
number depending on the pattern size. For a square 
“L,” the resulting pattern is stable and a filled square. 

Conclusion 

A classification of the character-recognition problem 
was used to show the distinctions among the various 
methods. The neuron-like net method gives a large 
class of spatial operations, transforming all input pat- 
tern points simultaneously. 

The neuron-like elements are relatively simple but 
capable of many operations. All speurons are con- 
nected alike to elements of an input presentation and 
are centrally controlled. Two distinct types of net con- 
nection have been studied. The first related neighboring 
points of the pattern; the second related points along 
given radii in the pattern. 

A simple model has been constructed and shown to 
yield useful transformations which reduce irregularities 
and extract pattern features. Transformations obtain- 
able by other types of elements and net connections 
have been indicated. A number-recognition method 
using one connection is currently being evaluated. The 
neuron-like net will be investigated more extensively by 
simulation on the IBM 704 with input patterns pro- 
duced by a scanner. 11 

11 W. H. Highleyman and L. A. Kamentsky, “A generalized 
scanner for character and pattern recognition studies,” this issue, 
p. 294. 
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-Successive transformations of a nonsquare 
and square “L” with Z — 2, 5 = 0. 


Fig. 10 — Picture-processing operations. 
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The Social Responsibility of Engineers and Scientists 

F. B. WOODf 


Introduction 

R ECENTLY there has been some interest in the 
question of the social responsibility of engineers. 
" A series of articles and letters to the editor ap- 
peared in the early part of 1958 in Computers and Auto- 
mation' 1-6 which dealt first with whether a journal such 
as Computers and Automation should publish articles on 
the social responsibility of computer scientists. Then 
specific topics such as the possibility of the destruction 
of civilization due to some component failure in the com- 
puter linked to a missile-warning radar network were 
treated. A series of viewpoints has been presented rang- 
ing from conscientious objection to working on a com- 
puter system that might be used for destructive pur- 
poses at one end of the scale, to a viewpoint of no con- 
cern with the use of one’s work at the other end. My 
interpretation of these discussions is that people are 
arguing about the implied hypothesis: there is a danger 
to the existence of our civilization because social institu- 
tions have too long a time lag in making adjustments to 
utilize the latest technological advances wisely. 

This apparently sudden interest in the social responsi- 
bility of computer scientists was preceded by a long and 
fluctuating development of concern for social responsi- 
bility in science and engineering. Meier has reviewed the 
status of social consequences of scientific discovery and 
has made specific recommendations concerning the so- 
cial responsibility of administrative scientists. 7 Layton 
has studied the history of the idea of social responsibility 
in the American engineering profession. 8 Rothstein has 
discussed some of the deeper philosophical aspects of 
these problems in his book. 9 The Western Joint Com- 


f IBM Corp., San Jose, Calif. 

1 Readers and Editor’s Forum, “Curse or blessing?” Computers 
and Automation, vol. 7, pp. 9-10; January, 1958. 

2 E. C. Berkeley, “Cooperation in horror,” Computers and Auto- 
mation, vol. 7, p. 3; February, 1958. 

3 A. A. Burke (I), W. H. Pickering (II), and Editor (III), “De- 
struction of civilized existence by automatic computing controls,” 
Computers and Automation, vol. 7, pp. 13-14; March, 1958. 

L. Sutro, “Comments on ‘Destruction of civilized existence by 
automatic computing controls,’ ” vol. 7, pp. 6, 31; May, 1958. 

4 Editor (I, III) and Readers (II), “The social responsibility of 
computer scientists,” Computers and Automation, vol. 7, pp. 6, 9; 
April, 1958. 

5 “Ballot on discussion of social responsibility of computer sci- 
entists,” Computers and Automation, vol. 7, p. 6; May, 1958. 

Later results, vol. 7, p. 6; July, 1958. 

6 N. Macdonald, “An attempt to apply logic and common sense 
to the social responsibility of computer scientists,” Computers and 
Automation, vol. 7, pp. 22-29; May, 1958. 

Discussion: “Locks for front doors,” vol. 7, p. 24; August, 1958. 

7 R. L. Meier, “Analysis of the social consequences of scientific 
discovery,” Amer. J. Phys., vol. 25, pp. 609-613; December, 1957. 

8 E. Layton, “The American engineering profession and the idea 
of social responsibility,” Ph.D. dissertation, Univ. of Calif, at Los 
Angeles; December, 1956. 

9 J. Rothstein, “Communication, Organization and Science,” 
The Falcon’s Wing Press, Indian Hills, Colo.; 1958. 


puter Conference at Los Angeles, Calif., May 6, 1958, 
conducted a panel on “The Social Problems of Automa- 
tion.” 10 

The various viewpoints appearing in Computers and 
Automation present an uncoordinated distribution of dif- 
fering ideas. The views of the 1958 WJCC panel have a 
certain amount of coherence. It would be desirable to 
find a straightforward way for an individual or engineer 
to determine his responsibilities in this area. The ideas 
which I am about to develop are hypotheses brought 
forward for the purpose of obtaining discussion on this 
important subject. At this stage, they represent my own 
personal views and are not to be construed as represent- 
ing a policy of my employer. I would have preferred to 
have this paper follow a historical analysis of this prob- 
lem of the social responsibility of engineers so that I 
could be sure that I am not repeating the same mistakes 
made in previous periods of interest in the subject. Per- 
haps by next year we will have a sounder base to oper- 
ate from in discussing the subject of the social responsi- 
bility of engineers. 

Distinction Between the Social Responsibility of 
Citizens in General and that of Specialists 

In a democratic nation such as the United States, all 
citizens have a responsibility to keep aware of the major 
problems of our country. This is necessary to be pre- 
pared to make wise decisions in electing public officials 
and in voting on basic policies. Specialists such as en- 
gineers and scientists of course share this basic responsi- 
bility with all citizens. I maintain that specialists have 
an additional responsibility beyond that of the citizen 
because of their special knowledge which is not readily 
accessible to the layman. 

What Social Responsibilities do Engineers 
and Scientists Have? 

In the long run, technology is undoubtedly making 
changes in the organization of our society. We cannot 
'expect engineers and physical scientists to become 
sociologists. However, we can expect engineers to ask 
questions and urge that appropriate social scientists 
study the social problems related to their work. Each 
scientist or engineer can ask himself where his own spe- 
cialty fits in the development of devices or new knowl- 
edge which may affect social organization. Then he can 
speculate as to what problems might come up in the fu- 
ture due to the application of his work. 

10 H. T. Larson (chairman), H. D. Lasswell, B. J. Shafer, and 
C. C. Hurd, “The social problems of automation,” panel discussion, 
Proc. WJCC, pp. 7-16; May, 1958. (AIEE Publication T-107.) 
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This may be a further stage in the development of the 
last decade in which it has become popular to use human 
factors engineering studies directed by psychologists to 
determine if proposed electromechanical devices requir- 
ing human reading or manipulation are consistent with 
the way human beings function. As our industrial soci- 
ety becomes more complex, it may be necessary to ex- 
tend this concept to “social factors” studies where the 
engineer calls in sociologists to investigate the social 
effects of applying his new knowledge or devices. 

At this stage the engineer’s responsibility may be to 
see if there is someone or some group studying these 
problems, and if there is not, he can recommend to the 
appropriate agency that such a project be undertaken. 
In this way the engineer can shorten the time lag be- 
tween the introduction of a new technology and the ap- 
preciation of its social consequences. I have noticed that 
even specialists sometimes fail to recognize the division 
point between their domain and that of other specialists. 
The important thing here is to obtain the advice of the 
appropriate specialists, instead of just relying upon our 
own ideas and feelings. 

A Checking Chart to Aid the Engineer in 
Developing Social Responsibility 

Let us construct a chart to outline the factors in- 
volved in determining what an engineer’s social responsi- 
bility should be. Such a chart is shown in Fig. 1. Start- 
ing in the lower left-hand corner, there is a box to write 
in the “Engineer’s Special Work.” To the right appears 
a box for the “New Knowledge and Devices” which may 
result from the work of the engineer. 

The next step is more speculative, namely the listing 
of “Potential Social Consequences” in the third box. 
The next box, “Find Expert Advice,” is for statement 
of the problems that the potential social consequences 
indicate as requiring investigation by social science ad- 
visors. A sample of the principal fields of science ad- 
visors who might be consulted is listed with boxes for 
checking to see if they are needed on this problem. Some 
engineers’ special work may lead to problems of a bio- 
logical or medical nature, while the work of others may 
require the aid of psychologists or social scientists. 

After preliminary contact has been established by the 
engineer with the required science advisors, the engineer 
must determine how far he will go himself in taking 
action. In the box on the right, four different magnitudes 
of action are indicated. The engineer may find all he has 
to do is to inform the appropriate social scientists about 
the problems, and they will pick up the responsibility 
from there on. In other cases there may be no funds to 
support the social scientists, and the engineer may feel 
it is his responsibility to campaign for appropriation of 
funds to support social science projects or to convince 
industrial management to include social scientists on 
their staffs. 

I claim that the engineer, who does not have much 
spare time because of his basic engineering work and 


SCIENCE ADVISORS 



Fig. 1 — A checking chart for analyzing the social 
responsibility of engineers and scientists. 


his family responsibilities, can find short cuts to under- 
standing the social implications of his work through de- 
vices such as the checking chart of Fig. 1. I have faith 
that the engineer can fulfill his social responsibility to 
help utilize the results of his work in keeping with man- 
kind’s highest aspirations. 

To fulfill his social responsibility the engineer must 
understand that it is a responsibility he shares with 
many people both inside and outside his profession. He 
may not need to devote a tremendous amount of time 
and energy to the social implications of his work. The 
key to success lies in developing a fruitful perspective 
of the relationship of his work to the society in which 
he lives. 

A Sample Use of the Checking Chart 

Consider an engineer working on the problems of data 
communication in connecting remote stations to a cen- 
tral computer. This is entered in the first block in Fig. 
2. A successful solution to the data communication 
problem might result in a universal credit system, where 
every store, airline, doctor’s office, race track, stock ex- 
change, etc., would have terminal sets which would 
make transactions when the customer’s coded credit 
card is inserted in the set. This would eliminate the need 
for money for most transactions. This new device is en- 
tered in the second block in Fig. 2. 

Then we go on to block 3, “List Potential Social Con- 
sequences,” such as: 

1) The elimination of money might mean there would 
be no more armed robberies, which would be a step 
forward in the development of civilization. 
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Fig. 2 — A sample use of the checking chart. 


2) The universal credit system might permit a 
shorter working week in sales and business ad- 
ministration work, permitting individuals to de- 
vote more time to creative hobbies which would 
enrich our community life. 

3) New problems might arise such as gangsters in- 
venting a new money system to finance illegal ac- 
tivities. 

4) Police measures instituted to suppress the under- 
world gangsters might interfere with groups work- 
ing on important social problems. For example, 
some public officials might be violating some of the 
provisions of the United States Constitution by 
discriminating against some minority racial or 
religious group. People in the community involved 
might feel like contributing a few dollars each to 
hire a lawyer to look into the case. These people 
might be afraid to contribute to this important 
cause when the accounting system would keep a 
record of each transaction. How do they know 
whether some future official will be able to dis- 
tinguish between supporting a legal test case to 
protect the Constitution, and supporting some 
subversive activities? In such a situation the ex- 
istence of this universal accounting system might 
inhibit people from protecting our constitutional 
government. 

The next step for the engineer is to find expert advice 
to evaluate which of the potential social consequences 
pose real problems that won’t just solve themselves in 


the natural course of events. It would be desirable if we 
engineers could just refer these questions to some agency 
such as the National Science Foundation (NSF) for 
consideration. At present the NSF has a limited repre- 
sentation from the social sciences, so the engineer may 
have to find appropriate experts wherever he can. To 
assist the engineer in finding advice, I have listed some of 
the more obvious classifications on the checking chart 
under the principal categories of biological, psychologi- 
cal, and social science. At present the potential science 
advisors can usually be found on the staffs of nearby 
colleges or research institutes. In this case I have 
checked the boxes opposite the relevant categories in 
this sample case. 

Informal discussion with these expert advisors results 
in a restatement of the problems as follows: 

1) How can we provide protection for individual free- 
dom in a more complex society where new technol- 
ogy such as computer-data communication systems 
permit a centralized accounting system covering 
all financial transactions in the community? 

2) Are there simple legal means and technical char- 
acteristics of a computer-data communication sys- 
tem which permit safeguards to prevent potential 
dictators from seizing control of the system as 
means of gaining control of our country? 

These questions as now restated are questions of im- 
portance to all citizens. The social scientists and the en- 
gineers both have additional responsibilities over and 
above their basic responsibility as citizens. However, the 
citizens at large have the basic responsibility of provid- 
ing for financial support of such studies. The extent to 
which the engineer is responsible for taking action on 
these matters depends upon the state of development of 
social science research projects. 

On the checking chart I have shown four degrees of 
action the engineer might take: 

1) Inform: If through governent agencies or private 
foundations there exist social science research proj- 
ects adequate to study the problems, the engineer 
may discharge his social responsibility by simply 
informing these social scientists about the poten- 
tial technological changes that may result from 
his work. In some cases an engineering research 
organization, in order to protect its proprietary 
interests, may prefer to hire social science con- 
sultants instead of releasing technological data to 
outside institutions. 

2) Discuss: If the social science consultants are avail- 
able and are financed, but do not have sufficient 
understanding of the technology involved, the en- 
gineer may have to organize discussions with the 
social scientist in order to pass enough of his spe- 
cial knowledge on to the people otherwise qualified 
to investigate these problems. 
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3) Propose: If there are insufficient social scientists 
available and the funds available are inadequate 
to support such research, the engineer may find it 
necessary to propose new appropriations and 
scholarships through his company, the existing re- 
search foundations, or through government agen- 
cies. 

4) Campaign: If the agencies having the power to 
allocate funds for the study of these social prob- 
lems fail to act, and the engineer is convinced that 
the problems will soon be urgent, he may have to 
plan stronger action such as campaigning to get 
political groups to pick up the problems. He may 
have to carry his campaign directly to the people, 
if the political leaders are insensitive to his pro- 
posals. 

Maintaining a Perspective 

In his specialized engineering work the engineer has 
acquired through education and experience the por- 
tions of basic science that are most useful in his particu- 
lar engineering assignment. The human needs on his job 
assignment usually have been evaluated by other people 
so that the human needs have already been translated 
into engineering objectives. To fulfill his role as “inter- 
preter of science in terms of human needs,” he needs 
some more direct contact with both science and with 
human needs. He can read such magazines as the Sci- 
entific American , which has popular articles on all levels 
of phenomena, as a way of keeping abreast of develop- 
ments in science. To obtain a more direct contact with 
human needs, he can participate in a local church social 
problems study group. In order to develop a better un- 
derstanding of the business world in which the results 
of his engineering work are used, he can read a magazine 
such as Fortune. He can develop a better perception of 
the social effects of science on a world scale by following 
the activities of the United Nations Educational, Sci- 
entific, and Cultural Organization (UNESCO) by read- 
ing one of their bulletins such as the quarterly Impact of 
Science Upon Society. The technical societies such as the 
AIEE and the IRE might eventually develop a monthly 
one-page abstract of significant articles relating to the 
social consequences of new technology. 


Conclusions 

Recent articles and panels on the social problems of 
computers and automation are a healthy sign that some 
engineers are developing a perspective of how their spe- 
cial field relates to the activities of mankind in general. 
Engineers need some kind of a framework to present an 
abstract but meaningful view of human activity to 
which they can correlate their own work. 

A checking chart has been developed to assist the en- 
gineer in tracing the potential social consequences of his 
own work. A table of major sections of the biological, 
psychological, and social sciences is included to assist 
the engineer in selecting expert advisors. 

In a democracy all citizens have a responsibility to 
keep aware of the major problems of our country. I 
believe that specialists such as engineers and scientists 
have an additional social responsibility because their 
knowledge is not readily accessible to the layman. 

I believe that the engineer can carry out his social re- 
sponsibility primarily by being concerned with the ques- 
tion: Are qualified experts investigating the potential 
social problems that might result from the engineer’s 
work? The engineer can use the checking chart de- 
veloped in this paper to assist in arriving at an answer 
to the question and in determining to what level of ac- 
tion his responsibility should extend. He shares with 
other specialists the responsibility for seeing that these 
problems are being studied and that provisions to inform 
the voters are made in our society. 

I do not suggest that the engineer should be responsi- 
ble for solving the social problems related to his work. 
The engineer’s responsibility is more of a coordinator to 
alert the people of our country to the status of our cov- 
erage of the problems. If the engineer finds that a social 
problem relating to his engineering work is not being 
adequately investigated, he has a responsibility to refer 
questions to management, social scientists, government 
agencies, and to the citizens at large to stimulate the 
investigation of such problems. 
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Emergency Simulation of the Duties of the 
President of the United States 

LOUIS L. SUTROf 


I. Introduction 

A TECHNICAL problem is arising in our demo- 
cratic government which engineers and mathe- 
maticians are equipped to assist in solving. The 
problem is how to approach making the kind of decision 
the President is called upon to make if missiles are de- 
tected on their way toward the United States. The 
number of facts on which a decision should be based 
appears to be increasing. The length of time in which 
to make the decision appears to be getting shorter. 

Dr. Isador Rabi described the problem in a speech 
given in December, 1957. 1 

Hydrogen bombs are going to be deployed at bases around the 
world under the control of many groups of persons. If an oncoming 
ICBM were detected 5000 miles away there might be time to inter- 
cept it with weapons not yet developed. But there will not be time to 
wake up the President to ask what to do, to call a meeting of the 
cabinet. 

Facing a question that has not been mentioned before 
in the literature of computer engineering, we should give 
great consideration to method. I propose that as we ap- 
proach each part of the problem we first describe it in 
the language most appropriate for the topic. Then let us 
attempt to translate this statement into computer and 
control terminology. Third, let us inquire to what extent 
an improved system can be built out of a combination 
of human beings and electronic equipment or electronic 
equipment alone. 

The work of three men is the precedent for the at- 
tempt, in this paper, to describe human beings, human 
relations, and man-machine relations in terms of com- 
puter and control engineering. One is Dr. Warren 
McCulloch, a psychiatrist now at M.I.T., who is de- 
scribing the human nervous system in this manner. One 
of his early papers was, “The Brain as a Computing 
Machine.” 2 One of his more recent is on the design of 
reliable circuits out of unreliable components, 3 giving 
one answer to the question of why the brain is as re- 
liable as it is. The second man to supply precedent is 
Dr. Karl Deutsch, a political scientist now at Yale. He 
came to wide attention with the publication of a book 
explaining nationalism in terms of communication engi- 

t Instrumentation Lab., Dept, of Aeronautics and Astronautics, 
Mass. Inst. Tech., Cambridge, Mass. 

1 R. K. Plumb, “New weapons peril U. S. life, Rabi says,” New 
York Times, vol. 107, pp. 1, 10; January 1, 1958. 

2 W. S. McCulloch, “The brain as a computing machine,” Trans. 
A I EE, vol. 6, pp. 492-497; June, 1949. 

3 W. S. McCulloch, “Stable, reliable and flexible nets of unreliable 
formal neurons,” Res. Lab. of Electronics, M.I.T., Cambridge, Mass., 
Quart. Prog. Rep., pp. 118-129; October, 1958. 


neering. 4 The third is Jay W. Forrester who is now simu- 
lating business and economic systems by computer pro- 
grams. He described his approach in, “Industrial Dy- 
namics — A Major Breakthrough for Decision Makers.” 5 
Prior to undertaking this he directed the development 
of the SAGE computer. I quote these three men exten- 
sively. 

This paper was written during evenings, weekends, 
and holidays. The opinions expressed are mine or those 
whom I quote, and not necessarily those of my em- 
ployer. 

II. The Problem 

We appear to be approaching an era of violence. The 
two major powers are manufacturing weapons to kill 
millions of people. They can be fired by the push of a 
button or by the signal from a computer. Many may 
soon be hidden so that they cannot be destroyed by 
bombing. As these weapons are built, installed, and con- 
nected to remote controls, the probability that one will 
be fired will rise rapidly, and the probability of a salvo 
to wipe out a nation will also rise, although more slowly. 

The problem that engineers need to consider requires 
them to design controls that operate within limits. They 
must so arm the United States that another country 
considering an attack will know that it will receive a 
violent attack in return. Such armament is called de- 
terrent power. On the other hand, they need to be con- 
cerned that building up deterrent power by the United 
States will lead to building up deterrent power by an- 
other country. This interaction is regenerative and leads 
to a rising probability of destruction of both sides. 

The need for deterrent power was presented by Albert 
Wohlstetter in an article entitled, “The Delicate Bal- 
ance of Terror.” 6 Wohlstetter is an economist for the 
Rand Corporation, a private nonprofit research corpora- 
tion working on aspects of national defense and survival. 
He states that: 

We must expect a vast increase in the weight of attack which 
the Soviets can deliver with little warning, and the growth of a signi- 
ficant Russian capability for an essentially warningless attack. . . . 
What can be said, then, as to whether general war is unlikely^ 
Would not a general nuclear war mean “extinction” for the aggressor 

4 K. W. Deutsch, “Nationalism and Social Communication,” John 
Wiley and Sons, New York, N. Y. ; 1953. 

B J. W. Forrester, “Industrial dynamics — a major break-through 
for decision makers,” Harvard Business Rev., vol. 36; July-August, 
1958. 

6 A. Wohlstetter, “The delicate balance of terror, ” Foreign Af- 
fairs, vol. 37, pp. 217, 222; January, 1959. 
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Fig. 1— Examples of channels in the man- machine system for making emergency decisions. 


as well as the defender? “Extinction” is a state that badly needs 
analysis. Russian casualties in World War II were more than 
20,000,000. Yet Russia recovered extremely well from this catas- 
trophe. There are several quite plausible circumstances in the future 
when the Russians might be quite confident of being able to limit 
damage to considerably less than this number — if they make sensible 
strategic choices and we do not. On the other hand, the risks of not 
striking might at some juncture appear very great to the Soviets, 
involving, for example, disastrous defeat in peripheral war, loss of 
key satellites with danger of war spreading — possibly to Russia itself 
— or fear of attack by ourselves. 6 

Wohlstetter concludes that our ability to strike back 
in spite of attack should make a foreign country’s ag- 
gression less likely. This is deterrence. It consists of two 
parts: first, the weapons, and second, the ability to reach 
a decision to use them. 

In arming against Russia, the United States is making 
a move which may be followed by more arming on the 
part of the Russians. This is positive feedback. It should 
be replaced by negative feedback of the kind to be de- 
scribed in the next section. 

Let us return now to the problem, namely, how to ap- 
proach making' the kind of decision the President is 
called upon to make if missiles are detected on their way 
toward the United States. Dr. Karl Deutsch who has 
studied this problem suggests breaking it down into the 
following parts: 7 

1) Broaden the base of facts which lead to a decision. 

2) Improve the reliability of the logic and computation used in 
processing these facts. 

3) Shorten the time for making the decision. 

Let us apply Dr. Deutsch’s analysis to a rough dia- 
gram of the man-machine system now used for making 
emergency decisions. (See Fig. 1.) The upper input il- 
lustrates electronic channels; the lower, written reports. 
The many other inputs have been purposely omitted. 
Data flow from these inputs through a stage of data 
processing before they enter the State and Defense De- 
partments. In the executive departments, the new data 
are correlated with data stored in the files and memories 
of the personnel. They report to the President and they 
may recommend action. The President usually chooses 
between alternatives presented to him. If there is time 
he will consult with the National Security Council be- 
fore deciding. 

7 K. W. Deutsch, private communication; February 21, 1959. 


We can plot on this diagram the three improvements 
recommended by Dr. Deutsch. To broaden the facts on 
which a decision is based, there needs to be a greater 
input of data. In addition, there need to be better ways 
of tapping the facts stored in the executive departments. 
To improve the reliability of logic and computation re- 
quires improved data processors. To shorten the time 
requires an increase in speed of the entire decision- 
making system. 

Pursuit of these three improvements can take us a 
long way toward a solution of our problem. To go further 
requires that we look closely first at the human being 
who holds the office of President, then at the biological 
computer which learns, remembers, and makes deci- 
sions. Delving into these biological mechanisms will al- 
low us to examine possible simulators of memory, ability 
to learn, and ability to make decisions. 

III. History of our Decision-Making System 

We have now described the problem this paper con- 
siders, in language appropriate to the problem. We be- 
gan to convert this description to computer language 
when we made the simplified diagram of the system 
(Fig. 1) and observed that this is a man-machine system. 
To progress further in making a description in computer 
and control terminology, we need to go back to the 
origins of this man-machine system. 

Perhaps by accident, the history of man-machine sys- 
tems has never been told as a whole. To read present 
texts on the subject one might be led to believe that 
man-machine systems are not much more than a hun- 
dred years old. Yet books are a kind of machine. Their 
parts move with respect to one another. Moreover, as 
a human being reads words in a book, he is letting these 
words program the biological computer in his head. 

Thus, a society that lives by rules written in books 
is a man-machine system. It has been evolving for 5000 
years, from the days when men first wrote on stones 
and clay blocks to the present when recorded knowledge 
fills vast libraries. The evolutionary process has been 
carried forward by inventive people who created new 
systems when the need arose for them. 

Benjamin Franklin might be called the first engineer 
to apply himself to the design of the American system. 
We know Franklin for his inventive work in the realms 
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of electricity and heat. He discovered the identity of 
lightning and electricity and advanced the theory, still 
valid, that electricity is of two kinds, “positive” and 
“negative.” He invented the lightning rod, a heating 
system for American homes, and the lending library. In 
1754, he started work on the American system of govern- 
ment. 8 The colonies were then threatened by the French 
and the Indians. The British government called a con- 
gress at Albany in the hope of getting the colonies to 
cooperate in raising troops and funds. Franklin, repre- 
senting Pennsylvania, drafted the plan which the -con- 
gress adopted, although the colonies did not. 

Franklin’s plan, redrafted twenty years later, became 
the Articles of Confederation, which were the system 
specifications for the first American government. When 
a more elaborate system was required, Franklin par- 
ticipated in the writing of the present Constitution. 

James Madison was the leading designer this time. 
Unlike Franklin, he had specialized in the design and 
operation of governmental systems. He had helped to 
set up the state government of Virginia. He had served 
in Congress and observed the weaknesses of the Articles 
of Confederation. When the prospect arose of writing a 
Constitution he wrote out a proposal for it. 

Adopted in 1789, the Constitution has grown since 
then by amendments and interpretation by courts. Con- 
gress has passed laws and administrators have made 
rules to carry out the laws. These rules are the programs 
which public officials pledge that their internal com- 
puters will obey. 

The system devised by Franklin, Madison, and the 
other founding fathers is diagrammed in Figs. 2-4. Lines 
represent information flow. Fig. 2 suggests that each 
Congressman is ideally part of several feedback loops. 
The people in a congressional district elect him, then 
demand action of him. His action may be to participate 
in writing a new law or in opposing a proposed law. One 
feedback loop consists of reports by newspapers, radio, 
and TV. In another loop, the law is carried out by some- 
one appointed by the President. Either the reports 
shown in the first loop or the impact of the law itself 
on wages, prices, and other interests of people shown in 
the second loop, may cause them to change their de- 
mand on their Congressman. If he acts to their satisfac- 
tion, they usually re-elect him. 

The election of the President occurs in another loop 
which takes four years to traverse. Formation of the 
Constitution occurs in still another loop with the longest 
time period of all. Fig. 3 shows the same loops as Fig. 2, 
but now all of Congress and the whole electorate are 
represented. The whole body of law enacted by Congress 
is shown as a block at the center. To it is attached a 
small block below it, representing the newly enacted 
law. 

8 H. C. Hockett, “Political and Social Growth of the United 
States, 1492-1852,” The Macmillan Co., New York, N. Y., pp. 188, 

189, 247, 286; 1935. 


A Congressman is also part of feedback loops that 
include very much larger groups of people than a con- 
gressional district. Such groups might be the automobile 
industry, the United States, or mankind. To show these 
feedback loops would require a very much more intricate 
drawing than Fig. 3. The number of these additional 
feedback loops and the quantity of people that they in- 
volve are a measure of the breadth of interests and the 
statesmanship of a Congressman. 

Fig. 4 shows the response that the system was de- 
signed to make to an offensive incident or series of inci- 
dents by another nation. The incidents bore on the 
electorate or on special interest groups among the 
electorate who demanded action from Congress and the 
President. When a “threshold of tolerance” was crossed, 
Congress declared war and the President carried out the 
war through his secretaries of War and Navy. In prac- 
tice, the incidents may have affected the owners and 
editors of mass media of communication so that they 
demanded action from Congress and the President. Or 
the incidents might come more fully to the attention 
of the executive than the public and thus the threshold 
of tolerance of the President would be crossed before 
that of the public and he would press Congress to a 
greater degree. This happened from 1939 to 1941. 

Germany under Hitler, Russia then and today, lack 
the free flow of information and feedback controls of the 
kind described above. They are less stable in their rela- 
tions with other nations. For example, a treaty, being a 
law, is part of the feedback control system of the United 
States. A treaty made by a dictatorship is observed or 
not as the dictator sees fit. 

A system like that devised for the United States could 
be devised for the entire world and provide stability in 
that area also. The man part of the system needs to be 
educated for its task. The machine part needs to be 
capable of greater speed and reliability than the original 
system designed for the United States. 

IV. Changes to the Decision-Making System, 
1950 to 1959 

We have described in computer and control terminol- 
ogy the system that operated to repel an attack up to 
1950. Let us now look at the changes that have been 
made in the present decade. Steps have been taken in 
each of the three directions that we considered desirable 
in Section II. 

Fig. 1 showed the pattern of response that has been 
taking shape since 1950. Congress is no longer part of 
the loop of response. In January, 1955, Congress 

handed to the President the power to defend Quemoy and Matsu if he 
likes, and to) use atomic weapons there at his discretion. . . . The 
pattern is now clear; in the Middle East, as in the Far East, Congress 
has left it to the President to fight or retreat as he sees fit. 9 


9 J. Reston, “War-making power; Quemoy crisis shows how con- 
trol passed from Congress to President,” New York Times , vol. 107, 
p. 4; September 4, 1958. 
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Fig. 2 — Congressman in feedback loops. 



This act of Congress formalized the practice begun by 
President Truman at the outbreak of the Korean Con- 
flict in 1950. This practice has served to shorten the 
time for making the decision, but I question if it has in- 
creased the reliability of the decision. The older system 
requiring debate in Congress and across the nation 
brought more minds to bear on the problem. 

The flow of facts into the decision-making system has 
been increased and speeded by two unique electronic sys- 
tems, SAGE (Semi-Automatic Ground Environment) 
and BMEWS (Ballistic Missile Early Warning Sys- 
tem). 

Fig. 5 shows the contents of the upper half of the 
diagram of Fig. 1 arranged in pictorial fashion. For 
simplicity, it shows only the part of the system where 
data are detected and moved at electronic speeds. The 
flow of reports from overseas to the State Department 
is assumed to be present but not shown. SAGE com- 
puters with radars above them are shown in the inner 
ring. BMEWS computers with radars above them are 


Fig. 4 — Response of the United States to an attack, 1789-1950. 



Fig. 5 — Hypothetical response of the United States to an attack in 
the present and near future. (The flow of reports from overseas to 
the State Department is assumed but not shown.) 
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shown in the outer ring. Their signals are shown entering 
a hypothetical central computer which organizes them 
for presentation to personnel in the Defense Depart- 
ment. These people merge the new data with pertinent 
data from their own memories or their files. Then they 
make selections from these merged data to compose re- 
ports and make recommendations to the President. 

The SAGE and BMEWS systems are part of the im- 
provement to the decision-making process that we seek. 
They broaden the base of facts on which a decision 
would be made. Let us examine those systems in more 
detail. 

Sage is a gigantic man-machine system whose radars 
watch the sky over the United States and feed informa- 
tion into the largest computers so far mass-produced. 10-15 
At each of about 30 “direction centers” in the United 
States, a 75,000-instruction program runs continuously 
to process the data and display them on large scopes. 
Fig. 6 shows a typical computer center. Fig. 7 shows 
this center being fed by information from radars at the 
left and giving out information to planes and missiles at 
the right. A tie is shown at the top to higher head- 
quarters and at the bottom to an adjacent direction 
center. 

Fig. 8 shows the point at which the SAGE computer 
gives up its data to a man who then makes a decision. 
Here an Air Force officer, looking at a displayed map 
on which approaching enemy planes are shown, orders 
planes or missiles to intercept them. The SAGE com- 
puter carries out his order by directing the plane or mis- 
sile to the target. 

In 1960 a system is scheduled to go into operation 
which will inspect in a similar fashion the air space be- 
tween the United States and other countries. This is 
the Ballistic Missile Early Warning System (BMEWS) 
whose radars have a range of 3000 miles. 16 The radar re- 
turns will be interpreted by a computer to discover 
whether each object seen moving at high speed is a 
meteor, a satellite, or an ICBM. As in the SAGE system, 
the conclusions reached could be used to generate a dis- 
play, send a message, or fire an interceptor missile. 
But it can also do more than SAGE. By tracing the 
trajectory of a missile, BMEWS can determine where it 

10 R. R. Everett, C. A. Zraket, and H. D. Bennington, “SAGE, a 
data processing system for air defense,” Proc. EJCC, pp. 148-155; 
December, 1957. 

11 W. A. Ogletree, H. W. Taylor, E. W. Veitch, and J. Wylon, 
“AN/FST-2 processing for SAGE,” Proc. EJCC, pp. 156-160; De- 
cember, 1957. 

12 R. R. Vance, L. G. Dooley, and C. W. Diss, “Operation of the 
SAGE duplex computers, ” Proc. EJCC, pp. 160-163; December, 1957. 

13 M. M. Astrahan, B. Housman, J. F. Jacobs, R. P. Mayer, and 
W. H. Thomas, “The logical design of the digital computer for the 
SAGE system,” IBM J. Res. Dev., vol. 1; January, 1957. 

14 H. D. Bennington, “Production of large computer programs,” 
Proc. Symp. on Adv. Prog. Methods for Digital Computers, ONR 
Symp. Rep. ACR-15; June 2, 1956. 

15 D. R. Israel, “Simulation in large digital control systems,” 
presented at the Natl. Simulation Conf., Houston, Texas; April, 
1956. 

16 “The ICBM’s: danger — and deterrents,” Newsweek, vol. 52, pp. 
56-57; December 22, 1958. 



Fig. 6 — A SAGE direction center building. (Photograph by 
Lincoln Laboratory, M.I.T.) 



Fig. 7 — Inputs to SAGE direction center are from radars at left, 
weather stations and commercial planes below. Outputs are to 
planes, missiles, adjacent direction centers, and higher head- 
quarters. (Drawing by Lincoln Laboratory, M.I.T.) 



Fig. 8 — Air Force officer ordering interception of enemy plane or 
missile. (Photograph by Lincoln Laboratory, M.I.T.) 


came from; then, assuming the missile takes no evasive 
action at a later stage, BMEWS can predict where the 
missile is likely to go. The prediction may make it 
possible to destroy the missile in the air. The estimate of 
where the missile came from can be the basis for a deci- 
sion to retaliate. 
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Congress’ response to the threat of nuclear attack has 
been to increase the effectiveness of the President and 
at the same time weaken the feedback loops of which 
the President is a part. This has reduced the sensitivity 
of the control system to public demands and restraints. 
It appears that attention should be given to providing 
new control loops to replace those that have been 
weakened or removed. But first let us give our full atten- 
tion to the problems of the President. 

V. Problems of the President in an Emergency 

The following are two situations that he might have 
to face. 

Fig. 9 shows country A (aggressor) launching an at- 
tack on country N (nonaggressor) intended to both 
destroy it and prevent it from retaliating. Let us assume 
that the deterrent power of country N is its ability to 
launch missiles. It appears that in the immediate future, 
the majority of launching sites are likely to be known, 
with the result that a retaliatory attack by missiles can 
be made only if it is started before the original attack 
arrives. There are several “ifs” here and if they are all 
to be satisfied, speed of decision is very important. How- 
ever, when the retaliatory power is hidden, as we are led 
to believe it will be in a few years, great speed will not 
necessarily be needed. A reliable decision requiring days 
if necessary appears far more important, lest an error 
be made. 

However, a circumstance that would demand speed of 
decision arises when the President’s life is threatened by 
an approaching missile. He has two alternatives: to or- 
der a retaliatory attack on a suspected country, or to 
wait, knowing that if he is destroyed someone else may 
order the retaliatory attack. Needed is fast processing of 
data that give him a reliable basis for decision in the 
time he has available. 

As we approach closer to an examination of the duties 
of the President, let us consider what Dr. Deutsch be- 
lieves a data-processing system can and cannot do 
today: 7 

1) Compute trade-offs (if I do this, then what?). 

a) What might be the effect of each of our actions on the 
civilians in this country? 

b) What will be the effect of each of our actions on the capabil- 
ities of the attacking countries? 

c) What will be the effect on third countries? 

2) Prepare estimates of the over-all effect of an action. 

3) Make recommendations to the President. 

No computer today has the learning capacity of an individual, 
much less that of a community. Computers should facilitate human 
and community learning by evaluating and cross-checking relevant 
data. Progress consists of putting more and more of the information- 
handling burden on the mechanical and electronic equipment and 
leaving an ever-smaller amount of ever-higher decisions to the human 
agent. 7 

But suppose the human agent does not respond be- 
cause he is asleep, as Dr. Rabi suggested, or for some 
other reason. It is the obligation of computer engineers 
and programmers to inquire what they can do to sup- 
plement the President. The American people may not 



Fig. 9 — Country A launching an attack on countrv N. 


accept what they propose. But proposals should be 
made periodically and in greater detail as more tech- 
niques become available. 

To that end a description will be made first of the 
emergency duties of the President, then of the qualities 
that led to his selection by the American people. These 
two descriptions can be regarded as part of the specifica- 
tions of a simulator. With these specifications before us 
we will then inquire how far engineers have progressed 
toward the emergency simulation of the duties of the 
President. 

VI. The Duties and the Qualities of a President 

The President’s task in the problem we are consider- 
ing is to order or not to order the military to act. He is 
there to make sure that the military are effectors, not 
decision points. For example, in an international crisis, 
military men get poised, ready to use their weapons. 
The President, on the other hand, will act the way his 
personality dictates. 

All that we ask of a President is that he be his best 
self. We mean by this that we ask him to apply to a 
major decision the traits that he demonstrated before 
taking office. Yet all of us have our ups and downs. 
There is always the possibility that a quick decision 
will be required when the President is not at his best. A 
system to back up the President, therefore, is being con- 
sidered. 

If such a system were to win the acceptance of the 
American people it would need some of the qualities of a 
President. What are some of these? 

To avoid the mental images of actual Presidents, let 
us refer to the President for the moment as a system — a 
very elaborate biological system. This system is put into 
its key position by a process whose first milestone is 
nomination at a national convention. It is then tested 
for three to four months in a kind of trial presidency dur- 
ing which it is presented with the problems of the Presi- 
dent and called upon to declare what decisions it would 



320 


1959 PROCEEDINGS OF THE WESTERN JOINT COMPUTER CONFERENCE 


make if it were the President. During this same time, the 
system is watched by reporters and TV: How does it 
treat its wife, its children, its friends? What are its be- 
liefs? Does it get angry easily? During this testing pe- 
riod, an image is built up in the minds of the voters. 
The image is one of a predictable system, to the extent 
that the voter has made observations. On election day, 
at the end of this test period, voters choose between two 
or more systems. 

Looking more closely at a system, we observe that 
what interests the voters most — or what we think should 
interest them most — is its information-processing sub- 
system. This is a network of switching and storage ele- 
ments. Of the 30 million million cells that comprise a 
human system about one tenth make up its information- 
processing or nervous subsystem. 

Dr. McCulloch calls this subsystem a “biological 
computer.” 17 Feeding information into it are the senses 
of sight, hearing, touch, taste, smell, and acceleration. 
It contains three kinds of memory, a means of learning, 
and a means of making decisions. It appears that a sys- 
tem to simulate (see the Appendix) the duties of the 
President will require the following properties of biologi- 
cal computers: 

1) Memory 

2) Ability to learn 

3) Ability to make decisions. 

In the following three sections we briefly describe 
and evaluate the steps that the computer engineering 
profession has taken toward simulation of the duties of 
the President. These efforts are for other purposes, but 
they serve this purpose. 

VII. Simulation of Human Memory 

By computer mepiory we mean both the static storage 
and the continuously running program that up-dates 
this storage and presents alternatives for decision. Let 
us look at the memories in both SAGE and in Industrial 
Dynamics Research programs at M.I.T. 

From the data received by its radars, a SAGE com- 
puter can predict the course of each aircraft in the air- 
space which it is monitoring. It can predict the points 
at which interception can be made by aircraft taking 
off from different airfields. An Air Force officer, watch- 
ing the two predictions plotted on a scope, can select an 
aircraft to make an interception. This action is illus- 
trated in Fig. 8. 

Just as the SAGE computer contains a model of mov- 
ing aircraft, so an Industrial Dynamics program con- 
tains the model of a company. In a diagram of a typical 

17 W.S. McCulloch, “Reliability of Biological Computers,” lecture, 
University of Pittsburgh, Pittsburgh, Pa.; May 10, 1957. (Unpub- 
lished.) 


model, 18 a solid line represents the flow of goods from 
the factory to the warehouse, to the retailer, and finally 
to the customer. Dashed lines represent the flow of in- 
formation from the customer to the retailer and all the 
way back to the factory. Numbers in the lines indicate 
the length of delays. Where a flow of goods and a flow 
of information touch, a decision is made. 

Forrester’s diagram represents a more advanced form 
of analysis than that shown in Figs. 3-5. The analysis 
itself consists of difference equations. 

The following (typical) equation tells how to calculate the level 
of Unfilled Orders at (the) Retail (end of the business) at time K: 

UOR k = UORj + DT(RRRj K - SSRjk). 

This equation tells us that the unfilled orders at retail at time, K, 
are equal to the unfilled orders at retail at the previous time, J, plus 
the inflow minus the overflow. 18 

The inflow is the product of a time interval DT and a 
rate, RRR, that holds from times J to K. The outflow 
is the product of the same time interval and another 
rate, SSR. Each equation is evaluated independently, 
using the results from the previous evaluation of all the 
equations. (See the Appendix.) 

While the simulator of the President would require 
facts and figures bearing on current issues, its memory 
of environment can be approximate. Industrial dy- 
namics models could serve this purpose. The model de- 
scribed above was intended to bring understanding of 
one company to its factory manager or corporation ex- 
ecutive. Models of the groups of companies that make 
up an industry would be useful to the simulator of a 
President. Models of the United States government, its 
allies, and its adversaries would be necessary. 

VIII. Ability to Learn 

The present system for making emergency decisions 
is one that learns. The biological computers in the sys- 
tem learn by changing, or increasing, the storage in 
their memories. The system as a whole learns in several 
ways, one of which is illustrated in Fig. 3. Here trials 
and errors are recorded in the memories of human 
beings and lead to new rules. 

The first method of learning we shall consider for the 
simulator is continual reprogramming. Dr. Richard C. 
Clippinger suggests : 19 

It will probably be necessary for the governmental simulator to 
operate in parallel with the President for a considerable time in 
order to learn. Computer learning is similar to the successive repro- 
gramming of a complicated process by means of more and more 
efficient programs, drawing intelligently on more and more past 
experience. Probably the longer it has been in operation the more 
efficient it will be, that is, the more it can accomplish in a few micro- 
seconds. 


18 J. W. Forrester, “Formulating Quantitative Models of Dynamic 
Behavior of Industrial and Economic Systems, Part I,” Industrial 
Dynamics Res., School of Industrial Management, M.I.T., Cam- 
bridge, Mass., Memo. D-16, pp. 8, 30, 31; April 5, 1958. 

19 Private communication, October 19, 1958. 
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The SAGE system learns in the manner described by 
Dr. Clippinger. A staff of programmers at the System 
Development Corporation in Santa Monica, Calif., at- 
tends the system and incorporates what is learned in 
an improved program. 14 To “get back into” a program 
of 75,000 instructions requires careful documentation 
augmented by computer methods for changing the 
>, program. The need to rework increasingly large pro- 
grams is an incentive for the second method of computer 
learning we are considering here — heuristic program- 
ming or “artificial intelligence.” 

Dr. John McCarthy describes artificial intelligence: 20 

These programs all use trial-and-error learning. A criterion for 
an acceptable solution is known. Then the machine “searches” a 
group of potential solutions for one answer that meets the criteri- 
on ... . Unfortunately the groups or classes of potential solutions of 
interesting problems are too large to be examined one at a time by any 
conceivable computer. 

Therefore, we must devise methods called heuristics for replacing 
the search of the class of potential solutions by a number of searches 
of much smaller groups. It is in these heuristics that the intelligence, 
if any, lies. 

Programs written by Newell, Shaw, and Simon have 
proved theorems of logic 21 and played chess, each with 
increasing skills A program written by Gelertner and 
Rochester containing the theorems and heuristics 
taught in a high-school geometry class has done the 
homework and taken the examinations of that class. 22 

But each of these programs handles only a limited 
range of problems. To extend the range we need to tie 
together a learning system with many storing systems. 
Each of us needs only to look in a mirror to see a system 
that does all these things and, in addition, makes de- 
cisions of the kind described in the next section. Exami- 
nation of this system is instructive. Its elaborate trans- 
ducers facilitate learning. These transducers include the 
eyes, ears, sense of touch, and inertia-sensitive inner 
ears. For each transducer there is a corresponding part 
of the biological computer where information is proc- 
essed before it is stored. Thus the transducers are not 
only detectors, they are filters, switching incoming in- 
formation toward its place of storage. Furthermore, 
they are adjustable filters. When you are looking for 
something, you have tuned your detectors to find that 
thing and ignore other things. Searching for a red ribbon 
in your bureau drawer, you tune your eyes to search for 
red and need only make a yes or no decision about each 
thing you see. 

The radars of the SAGE system report only targets 
moving at a speed greater than a certain amount. How- 
ever, the filter here is not adjusted by the computer. 

20 J. McCarthy, “Getting closer to machines that think,” New 
York Herald-Tribune, Engineering News Supplement; May 24, 1959. 

21 A. Newell, J. C. Shaw, and H. A. Simon, “Empirical explora- 
tions of the logic theory machine, a case study in heuristic,” Proc., 
WJCC, pp. 218-230; February, 1957. 

22 H. L. Gelertner and N. Rochester, “Intelligent behavior in 
problem-solving machines,” IBM J. Res. Dev., vol. 2, pp. 336-345; 
October, 1958. 


Moreover, radars “see” with very coarse resolution. 
Great sums of money have gone into the development 
of radar. There has yet to be a comparable effort at 
developing a high-resolution system with adjustable 
filtering to enable an electronic system to “see” the ob- 
jects that human beings not only see but think about 
most of their waking hours. 

In the absence of its own inputs, the simulator will 
have to take in the form of punched cards or electric 
signals the observations of those who do have these in- 
puts. Lacking a filtering system, it will have to use the 
classifications of events made by these observers. The 
classification can determine what heuristics and what 
part of the memory are to be employed. 

IX. Ability To Make Decisions 

Decisions can be made by computer programs ac- 
cording to predetermined rules. To run these rules and 
memory of the kind developed by Industrial Research 
and, possibly, a learning routine would make a slow 
simulator. Speed can be obtained by imitating the hu- 
man decision system. 

The decision-making apparatus in the human system 
is the reticular formation. It is the core of the brain 
stem. It is about as big around as a cigarette and about 
two inches long. Each of the several thousand large 
cells in this formation: 

receives signals from almost every source in the human body, coded 
in pulse-interval modulation to convey whence the signal came from 
and what happened there .... The reticular formation decides what 
he ought to do, what he should heed, how vigilant he ought to be and 
whether he has time for that idle fancy that inspires his future 
action. 23 

The method by which the several thousand large cells 
of this formation reach a decision is similar to that used 
by a battle fleet. 

Every ship of any size or consequence receives information from 
the others and sweeps the sky for hundreds of miles and the water 
for tens of miles with its own sense organs. In war games and in 
action, the actual control passes from minute to minute from ship 
to ship, according to which knot of communication has then the 
crucial information to commit the fleet to action .... It is a re- 
dundancy of potential command, wherein knowledge constitutes 
authority. 

In the reticular formation, each cell is like a ship of 
this battle fleet, able to take command when the infor- 
mation it has received is accepted, by all of the several 
thousand large cells, as that most requiring attention. 

Having spent much of his life mapping the nervous 
systems of monkeys and men, Dr. McCulloch is now 
studying the nerve connections of the human reticular 
formation. Every one of the several thousand large cells 
in this formation is connected to nearly every other. In 
addition, every one of these cells receives signals from 

1 

23 W. S. McCulloch, “Where is fancy bred,” Bi-Centennial Conf. 
on Experimental Psychiatry sponsored by the Western Psychiatric 
Institute and Clinic, Dept, of Psychiatry, University of Pittsburgh 
School of Medicine, Pittsburgh, Pa.; March 5, 1959. 
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some of the afferent cells of the body and from some of 
the cells of the cerebral cortex. This much can be deter- 
mined from dissection. What cannot be determined this 
way is how each cell influences every other. 

Fortunately, much is known about how the reticular 
formation performs. From this knowledge, McCulloch 
is considering a possible logical diagram showing how 
its neurons may affect each other. The resulting design 
can be implemented by artificial neurons such as those 
being built by Jerome Lettvin. 24 

Could the logical design also be implemented by a 
programmed computer? A small part of it could. Each 
neuron can be represented by storage registers contain- 
ing the neuron threshold, the state of the neuron after 
the last cycle of excitation and inhibition, and the na- 
ture of the connections to other neurons. To simulate all 
of the interconnections in the clock time of the brain 
would require the processing of at least 1000(!) instruc- 
tions in 0.1 second. 

An assembly of artificial neurons is called a parallel 
computer, meaning that all logical operations are oc- 
curring at the same time instead of sequentially, as in a 
programmed computer. For the present, parallel logic is 
a goal to work towards while using programmed logic. 

X. When Should the Simulator Be Used? 

A programmed simulator, although slow, can render 
a service now by providing an operating model of the 
environment of the President, by demonstrating how 
new rules may be learned, and by demonstrating how 
rules may be applied to make decisions. Starting as a 
guide to decision-makers, a simulator could be gradually 
improved until it might be able to make decisions on its 
own. It would be for Congress, the President, and the 
American people to decide if the simulator should be 
allowed to do this. 

Three measures will be suggested as aids in deciding 
when a simulator should be used in this way. One meas- 
ure is the extent of internal restraint. As Dr. Deutsch 
puts it: 

For any large . . . memory system, the specific content of all com- 
binations that might become dominant . . . cannot be predicted. The 
possibilities are too numerous as to what combinations might arise 
in a human mind, or in any computer . . . remotely comparable. 
Hence we fear entrusting political control to any one human mind, 
or to any small committee, even though we trust them as being 
human personalities . . . who share the unspoken and unstated val- 
ues and inhibitions of our culture and religion. 

An electronic machine (at present) can include in its memory, 
at best, only those rules of law, morality and religion that have 
been stated explicitly in words. . . . These . . . rules a computer 
would then apply with terrible literal-mindedness. It might become 
the electronic embodiment of the letter that kills, rather than of the 
spirit that gives life. 

24 J. Y. Lettvin, “Nerve Models,” Res. Lab. of Electronics, 
M.I.T., Cambridge, Mass., pp. 178-179; January 15, 1959. In the 
diagram, the unlabelled diode at the left is the excitatory input; that 
at the right, the inhibitory input. The wiper of the potentiometer 
determines the threshold. 


Limitations of computers, when recognized by engi- 
neers, appear to stimulate efforts to overcome the 
limitations. This gives direction to the development of 
new techniques of memory, ability to learn, ability to 
make decisions and the additional categories mentioned 
by Dr. Deutsch. 

A further challenge from him should be quoted: 

To build into a computer the properties of perceptiveness, tol- 
erance of ambiguity, mercy and spirituality — that is, perceptiveness 
toward second-order and higher-order patterns of preferences — 
would require capabilities far in excess of those available at present. 
So long as such vastly greater capabilities have not been developed, 
computers can aid human judgment but cannot safely replace it. 

The second measure we shall consider is the extent 
and sensitivity of feedback control such as that in Fig. 
3. If we find difficulty in trusting one human mind, we 
shall have greater difficulty in trusting a simulator. 
However, a control network is possible consisting of 
many simulators. Given authority to act, a decision 
would be made by a majority of those simulators that 
had not been destroyed by attack or sabotage. Each 
would simulate the duties of a Congressman or group of 
Congressmen. As Dr. Clippinger has suggested for a 
simulator of the President, 19 each should be operated in 
parallel with the one it is simulating so as to: 

... (a) learn, (b) demonstrate to Congress and the President that it 
is worthy of their respect and faith for at least a limited period, (c) 
provide time to educate and persuade the people of this democratic 
country that it should be used. 

Such a network could have feedback controls as exten- 
sive as Congress itself, at least during trial periods. 

The third measure of when a simulator should be used 
is the measure of the emergency when, if Congress, the 
President, and the American people have previously 
approved, the simulator would be permitted to act. 
Seeking this measure takes us back to the question 
raised by Dr. Rabi. In accord with that, two conditions 
would make the use of a simulator desirable. One con- 
dition is imminence of destruction such as a 90 per cent 
probability that 5,000,000 people will be killed, a 9 per 
cent probability that 50,000,000 people will be killed, or 
any of the equivalent probabilities. The other condition 
is the inability of the President to respond. 

Equipment with extraordinary reliability is needed 
to determine both of these conditions. The estimate of 
probable deaths would need to be made by a computer 
that has both information about approaching missiles 
and models of population. The President’s ability to 
respond in a predetermined time could be determined 
by interrogating him, by requiring him to report periodi- 
cally, or by some other method. 

The desired reliability should be obtained either by 
operating computers in parallel, which is done in the 
SAGE system, or by applying the theory of building 
reliable circuits out of unreliable components. 3 The 
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latter requires the kind of parallel logic described in the 
last section with interconnections and thresholds so 
selected that the failure or erratic behavior of one or 
more elements will not affect the output. 

Appendix 

Definition of Simulation 

The word “simulation” is used in this paper in its 
modern technical sense: 25 

... to assume the appearance of, . . . without any intention to de- 
ceive. I refer to its use in the field of mechanical-electronic com- 
putation. Here the procedure is to simulate physical or mental 
processes in setting up a problem which is then given to a computer to 
solve. 25 

The Industrial Dynamics Research program at 
M.I.T. uses the words “make a model of” in the place of 

25 J. C. Warner, “The fine art of simulation,” Carnegie Alumnus, 
Carnegie Inst, of Tech., Pittsburgh, Pa.; 1959. 


“simulate.” The model in this case is a set of equations. 
These M.I.T. people save the word simulate to describe 
the evaluation of these equations, one at a time, for a 
given set of input conditions. They solve the equations 
at time intervals which are short compared to the 
shortest delay intervals of the system being modeled. 
They are thus simulating simultaneous solution. 

In this paper “simulate” is given the meaning of the 
first paragraph above. Simulation here is intended to 
achieve a “quality” equal to or excelling the perform- 
ance of the human being to be simulated, for the periods 
when it is given his responsibility. The “quality” of per- 
formance is a composite of breadth of facts which lead 
to a decision, reliability of the logic and computation 
used in processing these facts, speed, and human con- 
siderations. A simulator might attain acceptable qual- 
ity by excelling in some of these considerations while 
falling short in others. 


Can Computers Help Solve Society’s Problems? 

JEROME ROTHSTEIN f 


Introduction 

T HE advent of large-scale computers gave new im- 
petus to mechanizing the handling of tremendous 
quantities of data. It also indicated the possibility 
of carrying out many ventures of social significance 
which are now completely impractical. It is hard to see 
an important social revolution in the first, per se. Auto- 
matic billing of telephone subscribers or mechanization 
of clerical activities, for example, is only substitution of 
machine for manual activities. This has been going on 
continuously since the beginning of the industrial revo- 
lution. Exciting prospects emerge when one considers 
fields characterized by enormous amounts of data to- 
gether with complicated intertwining causal relation- 
ships buried under statistical blur. 

The present paper considers a few of very many pos- 
sibilities. They were chosen mainly because one might 
expect them some day to have enormous impact, both 
on the individual and on society. The first group bears 
on the weather and on economic planning and policy, 
the second on various questions of public health, and 
the third on “the proper study of mankind,” man him- 

f Edgerton, Germeshausen and Grier, Inc., Boston 15, Mass. 


self. They are tentative groupings with no pretense to 
completeness or profundity. It is believed, however, 
that they make some general statements plausible. 
These are that modern computer and data handling 
techniques may 

1) lead to making our economic system more produc- 
tive, and to smoothing cycles of inflation and deflation, 
or employment, and of farm income; 

2) revolutionize our ideas of public health, and make 
the world a more wholesome dwelling place ; 

3) revolutionize our knowledge of ourselves, our 
abilities, susceptibilities, mental, physical and genetic 
constitution, as well as diagnostic and preventive 
medicine. 

We believe it is the responsibility of the computer 
engineer and scientist to point out such potentialities, to 
acquaint specialists in many fields with what computers 
can do, to collaborate with them in applying computer 
techniques to those fields, to keep research foundations 
and government agencies aware of areas worthy of sup- 
port, to keep administrators, policy makers, and legis- 
lators informed and advised, and thereby to assist in 
the formulation of sound public policies. 

In the discussion below, military, industrial, and sci- 
entific applications are very largely neglected. This is 
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not because these fields are unimportant, but rather 
because their importance has been so well recognized 
that a paper of this length could now contribute little 
to them. 

Weather, Computers, and Economic Policy 

The historic role of weather and climate in human 
and economic terms needs no belaboring. Famine, 
drought, floods, plenty, epidemics, mass migrations, the 
rise and fall of civilizations, wars and their outcome 
have often been engendered or decided by long- and 
short-range weather and climatic variations. With fore- 
knowledge, as in the biblical story of Joseph and 
Pharaoh, countermeasures become possible, with tre- 
mendous gain in economic and human terms. Modern 
data processing is making it possible to use the tremen- 
dous mass of accumulated and current meteorological, 
astronomical, and climatic data to make long- and 
short-range weather predictions more accurate. When 
continuous global cloud surveys by satellite are avail- 
able, accuracy will advance even more, and even 
weather control will doubtless be more effective. The 
average citizen will know when to carry an umbrella or 
plan a vacation, retail establishments will plan inven- 
tories more intelligently, and farming will be less of a 
gamble. If one knows a drought is going to occur, for 
example, it is silly to plant extensively where there is no 
artificial irrigation. If it is known that a good crop is 
imminent and that the following year will be one of 
drought, the government might well consider making 
provisions for crop storage. Knowledge of this sort on a 
global scale can go a long way toward eliminating fam- 
ine in one place and glut in another. The effect of this on 
international relations and on the likelihood of revolu- 
tions and wars may well be stupendous. More locally, 
public works programs in agricultural areas could be 
planned to dovetail with an expected decline in demand 
for farm labor, with lowered shock or recession of the 
whole economy. Production based on agricultural raw 
materials can be more intelligently programmed when 
yields are forecast even approximately. Inventory pol- 
icy and servicing depots for farm machinery could be 
set up more efficiently. Railroad rolling stock, truck 
fleets, and the like can be administered more efficiently 
when demand is predicted more accurately. Clearly, the 
techniques of operational research, which could make 
extensive use of data-handling systems, would become 
much more readily applicable to tremendous areas of 
planning, allocation, production, servicing, inventory, 
and other policy problems. Expert consulting services 
for small enterprises, unable to support operational re- 
search on their own, would become economically justi- 
fied, and might well lower failure rates for small busi- 
ness. Fuel, electric power generation, and hydroelectric 
policies are also weather dependent, and the economic 
value of long-range weather information in these fields is 
clearly tremendous. It hardly seems an exaggeration to 
say that little of the national and world economy is un- 


affected by weather, and that most of it would benefit 
by better knowledge of weather trends. 

There is no reason to restrict considerations like these 
to agriculture. Some control over inflation and deflation 
is exerted by the Federal Reserve System and the Treas- 
ury by control of discount rates and the sale of govern- 
ment securities, for example. Policies presently estab- 
lished as “curative,” often with undesirable lag, may 
become “preventive” when economics, aided by large- 
scale computer techniques, has made economic fore- 
casts more accurate. The forecast problem is comparable 
in complexity to the global weather problem, and may 
well be attacked with similar techniques. Fluctuations 
may be one of the prices of freedom, but the instability 
and suffering engendered by excessive swings can sure- 
ly be much reduced by intelligent countermeasures, and 
with no real loss of freedom. It would seem almost sui- 
cidal not to develop such countermeasures. Computer 
techniques can be expected to play a vital role in this 
field. 

Computers and Public Health 

In many fields of public health, detailed cause-and- 
effect relationships are submerged by multitudinous 
accidental factors. Masses of data gathered over long 
periods of time must often be digested and tested by 
sophisticated statistical techniques before valid state- 
ments can be made about them. All statements, on 
which action is to be taken, should be treated as test- 
able statistical hypotheses, with action justified when 
the hypothesis has reached some preassigned confidence 
level. The examples below were chosen at random, and 
represent but a small sample of the total one could cite. 

Fluoridation of drinking water is a measure backed 
by much competent medical opinion and bitterly fought 
by a number of lay groups. Proponents of fluoridation 
claim that tremendous reductions can be made in 
dental caries with no ill effects. Its opponents make dire 
predictions about the effect of fluoride on the kidneys, 
the nervous system and almost every other organ in the 
body. There are millions of people at the present time 
who consume fluoridated drinking water. It therefore 
seems entirely feasible to amass completely convincing 
and compelling data on the correlation between fluori- 
dation of drinking water and every ill to which the flesh 
is heir. Of course, even in the face of compelling evidence 
one often finds crackpots who refuse to be convinced. If 
one has faith in democracy, however, one must believe 
that in the long run the majority will recognize and ac- 
cept the truth. If statistically valid evidence of this sort 
is gathered — and computers can certainly play a vital 
role in doing this — and it turns out that almost all tooth 
decay can be safely and permanently eliminated from 
the population, the gain would be incalculable. 

A similar situation appears to be involved in estimat- 
ing the effect of radioactive background and cosmic 
rays on stillbirths, cancer, and genetic impairment of 
large populations. It has been supposed by some that 



Rothstein: Can Computers Help Solve Society's Problems? 


325 


there is a certain threshold of danger. Others maintain 
that any increment in the radiation background, no 
matter how small, will produce additional cases of de- 
fective births, mutations, and of cancer. One clearly 
cannot make controlled human experiments, but the 
city of Denver is a mile higher than New York. The in- 
habitants of Denver are thus exposed to a higher cosmic- 
ray background than the people of New York. In addi- 
tion, uranium ores in Colorado and a number of other 
western states must subject the people in those regions 
to radioactive background, to uranium materials in 
their foods and the like which are absent in many coastal 
areas. Some parts of the world, such as Travancore, 
India, have high natural backgrounds due to the pres- 
ence of monazite sands or other radioactive materials. 
It thus seems entirely feasible toget statistically convinc- 
ing data on the incidence of conditions of all sorts in ex- 
isting populations living under a variety of radiation 
backgrounds. With extensive, computer-processed data 
one can make a more realistic assessment of the dangers 
of atomic testing, for example. One can perhaps find 
whether a threshold for radiation damage exists, or 
even if there are beneficial effects of radiation in very 
small amounts. There is some evidence that normal 
individuals have some immunity to cancer. It therefore 
does not seem entirely impossible that fall-out in very 
small amounts might have no effect on normal indi- 
viduals but could shorten the life expectancy of indi- 
viduals fated to succumb to cancer by a small amount. 
Similarly, data on abnormal births and congenital de- 
fects might lead to a better understanding of the genetic 
hazards. 

A third field is that of poisons, taken in a broad sense. 
Coal or oil smoke from the heating plants of private 
dwellings, and exhaust fumes from automobiles could 
conceivably have subclinical toxic effects. It is known 
that carcinogenic substances are produced in oil refin- 
eries. Such substances can also be produced by combus- 
tion of tobacco and many organic or carbonaceous ma- 
terials. Some studies indicate a connection between 
smoking and the incidence of lung cancer and cardio- 
vascular diseases. It seems possible that “chemical fall- 
out” from daily industrial, heating, smoking, and auto- 
mobile-riding activities could be a hazard greater than 
radioactive fall-out. Chemical agents can also produce 
mutations. Among these are colchicine and other com- 
plex compounds related to coal tar derivatives and other 
substances with great biological activity. A large-scale 
statistical survey of the quantities and identities of at- 
mospheric and other contaminants of all sorts, and their 
correlations with the incidence of various diseases, ap- 
pears eminently desirable. If one found, for example, 
that small communities in the Rocky Mountains have 
far lower incidence of cardiovascular disease than smog- 
gy industrial areas, with suburban residential areas in- 
termediate, then laws requiring the chemical clearance 
of smog and precipitation of dust particles might ulti- 
mately be in order. Many industrial poisons are known 


and in many places measures have been taken to pre- 
vent atmospheric contamination, but by and large a 
serious and obvious outbreak of some condition seems 
to be required before public opinion is aroused enough 
to take action. It is therefore quite possible that many 
cases of this sort are unnoticed because cause-and- 
effect relations are buried in a sea of accidental factors. 
The possibility that we can live under healthier condi- 
tions seems too real for us to overlook these potential 
applications of computer processing and analysis of the 
tremendous amounts of data required. 

Computers, The Individual, and Society 

Every human being is a unique complex universe. As 
life insurance companies well know, this does not pre- 
vent the drawing of valid statistical inferences about 
human populations. The more homogeneous a statisti- 
cal population, the more accurately can inferences be 
made about unobserved individuals on the basis of ob- 
servations on a sample. In between the heterogeneity of 
homo sapiens as a whole, and the homogeneity of identi- 
cal twins, one senses the possibility of a classification 
scheme which would divide mankind into groups of suf- 
ficient homogeneity to make it possible to draw med- 
ically or otherwise useful inferences about a group. The 
scheme would certainly be complex; and there is no a 
priori reason to assert, for example, that even a million 
categories would enable one to predict that a particular 
ten-year-old boy, let us say, will develop angina pectoris 
between the ages of forty-five and fifty-five, or that a 
healthy young woman will have a cervical cancer before 
she is sixty. 

The existence of recognizable hereditary characteris- 
tics, the small number of clinically distinguishable blood 
types, the “tendencies” or “predispositions” known to 
practicing physicians, the experience of plant and ani- 
mal breeders, the development of strains of laboratory 
mice like waltzing mice or those who invariably develop 
cancer, and research in heredity ( e.g ., on Drosophila or 
Neurospora), and other examples, all suggest that very 
useful classifications probably exist whose numbers of 
categories are not astronomically large. If a number of 
the order of ten thousand categories could adequately 
characterize an individual biologically, such a charac- 
terization would be of tremendous use in preventive 
medicine. Who knows the extent to which cancer and 
heart trouble could be anticipated, and perhaps pre- 
vented or corrected or ameliorated by proper control of 
diet, activity, or environment? The Rh factor which 
used to kill babies born to mothers of opposite Rh blood 
type is now understood well enough to predict the 
couples to whom this would happen. The tragedy is now 
avoidable. Who can say what anguish and burdens 
could be prevented if a similar understanding of con- 
genital idiocy and other genetically based abnormalities 
or inferiorities could be achieved? Who knows but that 
the patterns of potential parents of future Einsteins 
might become recognizable? 
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Such knowledge, like all knowledge, would be a 
double-edged sword. Unattractive Orwellian prospects 
suggest themselves under totalitarian regimes. But 
under a form of government in which the sanctity of 
human personality is preserved, one sees the possibility 
of great good, of healthier individuals, and of a grad- 
ually improving human stock. The concept of eugenics 
is basically good, and if the individual is free to use eu- 
genic information or not, if he is never coerced, and if 
the program is kept free of bias and fanaticism, only 
good can come of it. 

Setting up such a program would ultimately require a 
tremendous amount of experimental data, data process- 
ing, analysis, and testing of statistical hypotheses. Years 
might pass before results with practical application 
could be obtained (though we doubt it), and automation 
of hundreds of complex laboratory procedures, many 
not now known, might be necessary before people could 
be adequately “typed.” But if a program developed 
capable of coming anywhere near the goals described it 
would bring benefits even greater than those discussed. 
The old problems of nature vs nurture, or heredity vs 
environment as determinants of human capability and 
achievement might become better understood. Factors 
leading to more valuable and satisfying lives might 
stand out in bolder relief if the life patterns of a large 
number of biologically similar individuals were studied 
with the tools of psychology, sociology, and anthropol- 
ogy. A far deeper understanding of psychosomatic in- 
teractions would almost surely result. Cross-cultural 
studies, as part of the broad program, might give deeper 
insight into the interactions between individual psychol- 
ogy and the cultural milieu, perhaps to enable us to see 
how the values and goals of different cultures (which 
have much to do with the stresses and motivations of an 
individual) generate different statistics of stress syn- 
dromes, diseases, antisocial behavior, or mental condi- 
tions in biologically similar individuals. Could such 
knowledge be obtained, we could begin to develop a 
science of society which would permit conscious improve- 
ment of our customs, values, and motivations, pro- 
vision of sanifying influences, and tend to maximum 
development of individual talents and satisfaction. 
These utopian dreams, distant as they may now seem, 
need not be unattainable. With computer techniques, 
automation, and the devoted labors of inspired, deter- 
mined, and creative people, such dreams will come ever 
closer to reality. 

In these times of international tension, omnipresent 
powder kegs and atom bombs, one can seriously main- 
tain that the world cannot afford to neglect these possi- 
bilities. 

Conclusion 

There is an old witticism about the difference be- 
tween a scientist and a philosopher. The former applies 


increasingly refined techniques to an increasingly spe- 
cialized and narrowing aspect of the world. As time goes 
on he knows more and more about less and less. The lat- 
ter continually generalizes, going into increasingly 
abstruse abstractions. As time goes on he knows less 
and less about more and more. In the end, the story 
goes, the scientist knows everything about nothing, 
while the philosopher knows nothing about everything. 
While neither scientist nor philosopher would admit ap- 
proaching the limits described, there is enough truth to 
the story to show that it might sometimes be good for 
the scientist to be a little more philosophical, and for 
the philosopher to be a little more scientific. 

It is hard, perhaps, for the computer scientist or en- 
gineer, wrestling with detailed problems of hardware, 
logical design, budgets, deadlines, maintenance, and 
operation, to take the philosophic approach very often. 
The viewpoint here espoused is that once in a while a 
broad philosophic look at things can be valuable. This 
paper has therefore studiously avoided technical minu- 
tiae and conventional computer applications in favor of 
bold (we hope not reckless) extrapolations and generali- 
zations to broad problems. What we dream today we 
achieve tomorrow. It is our responsibility not only to 
develop day-to-day technology, but also to philosophize 
enough to find fields where technology will promote ad- 
vances measured in broad human terms. Just as we 
have learned to dig better with steam shovels than with 
our fingers, so must we learn to tackle civilization’s prob- 
lems with the aid of computer techniques rather than 
with our bare brains. 

We cannot wait until the perfect master plan is 
worked out. Not only would this never be done, but 
even if some plan were set up as near perfect, it would 
probably be obsolete before it could be implemented. 
The same kinds of piecemeal attack that characterize 
all research would have to be employed. There is much 
that can be done immediately and in parallel (some be- 
ing done already), such as 1) mechanizing public health, 
hospital research, and individual physicians’ case his- 
tory data, 2) developing “common language” techniques 
to permit easy exchange and consolidation of informa- 
tion gathered by independent agencies, individuals, and 
countries, 3) making it possible to consolidate scattered 
data on individuals in order to amass birth-to-death 
histories, 4) creating techniques of integrated coopera- 
tion between individuals, institutions, local, state, fed- 
eral, and international organizations, 5) encouraging 
cross-disciplinary research, 6) sponsoring programs de- 
signed to uncover areas to work on immediately, 7) per- 
fecting routines for testing increasingly complex statis- 
tical hypotheses, 8) formulating special fields so that 
computer techniques can be used on them, and 9) doing 
this whenever it becomes feasible. We must press for 
the nine parts of action to support the one part of philos- 
ophy. 
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The Measurement of Social Change 

RICHARD L. MEIERf 


I S IT possible to build a synoptic instrument, similar 
to a telescope or a radar network, for viewing one’s 
own society? How may we interpret the myriads of 
social activities that are presently undertaken? Prelim- 
inary explorations suggest that we need sensing tech- 
niques, or transducers, that pick up changes going on 
inside the society. External indicators, like air photos, 
are too superficial. We are faced with a problem of dis- 
covering what operating characteristics a deep-prob- 
ing instrument should have so that it may be as practi- 
cal and useful as possible. 

Economists judge change in society by modifications 
in the makeup of the gross national product and the level 
of expenditures, political scientists can analyze elec- 
tions and polls, but sociologists and anthropologists 
have no cumulative sets of accounts or aggregate in- 
dexes. They have had hopes, however, similar to those 
expressed by Lazarsfeld [3 ] : 

Our economic statistics are today quite well advanced. We know 
how much pig iron is produced and how much meat is exported every 
year. But we still have very little bookkeeping in cultural matters. 
The content of mass media of communication is an important and 
readily available source of social data, and it will not be surprising if 
this analysis becomes a regular part of our statistical services in the 
not too distant future. 

These statistics have not yet come into being because 
the labor cost was high, the time lags were great, and the 
system description was incomplete, so it has been impos- 
sible to state how one set of measurements related to an- 
other. 

Let us take a brief, searching look at the social system. 
Society is maintained and changed by the behavior of 
its members. Intuitively one feels that the basic unit of 
behavior is the act, but acts are not as easily counted 
and differentiated as particles, molecules, or organisms. 
Satisfactory data can only be obtained when actors are 
forced to confine their behavior within certain preset 
specifications or codes, which may be called languages, 
currencies, habits, or “standard operating procedures.” 
This behavior must be observed in public spheres, since 
the objective, detached observer is missing in private 
affairs. The latter will require altogether different instru- 
ments and techniques for data accumulation, and will 
not be taken up here. 

By far the most promising attack upon the problems 
of measurability is offered by lumping together small 
sets of acts into transactions. A social transaction in- 
volves, among other things, the emission of a message 

f Univ. of Michigan, Ann Arbor, Mich. 


together with evidence of its receipt — apparent to an 
observer, but also, through one or another form of feed- 
back, to the agency responsible for emission. 

At any given moment, the population of the society 
can be divided into senders, receivers, and nonpartici- 
pants, much as the economist divides his population 
into producers and consumers, and each participant 
must play both roles. The message normally contains 
some information that is novel to the receiver, more that 
is redundant, and some symbols that are quite unintelli- 
gible. Some messages are not communicated directly to 
receivers , but are stored in libraries, files, and artifacts 
where they become a resource embedded in the social 
environment. Uncoded information may be gleaned 
from the environment through systematic observation. 
Scientists, weather observers, diagnosticians, and other 
professionals have been trained to reduce these phe- 
nomena to coded, communicable form (Fig. 1). The 
information flows should be sampled where the wavy 
lines occur. 


Observation of the 
Social and. Physical 
Environment 



Environment 


Fig. 1 — Communications flows in society. 

The term “information” at this point has been used 
in its intuitive sense. At a later stage, it will be shown 
that the demand for information storage (used now in 
its technical sense) in our instrument corresponds 
crudely with the volume of these flows in society — about 
as well as national income figures represent the com- 
bined satisfactions of consumers. The greatest diffi- 
culty in the design of our instrument is the conversion 
of all of the codes for human communication, oral, writ- 
ten, graphic, gestural, musical, etc., into a single code 
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which is convenient for machine handling. We may have 
to incorporate human operators, whose skills resemble 
those of cataloguers in a library, for the more difficult 
features of translation. Fortunately, as we shall see 
later, the bulk of the information flow in modern society 
is in the form of printed language, which seems amen- 
able to automatic sensing, coding, and abstracting 
(Luhn [4]). 

Given the complexities in social communications, 
how would a representative and comprehensive over- 
view of the social transactions be obtained for our instru- 
ment? The mass media — television, radio, magazines, 
newspapers, books, records, catalogues, direct mail ad- 
vertising, etc. — could be recorded at the source. Schools, 
conferences, committee meetings, shop talk, live per- 
formances, etc., would have to be random-sampled. But 
on what basis? 

Here we are forced to refer to a fundamental property 
of modern society. A sender can have many simultane- 
ous receivers , but any given receiver usually accepts mes- 
sages from but one sender at a given moment; on rare 
occasions he may pay attention to two or three, but no 
more. The decision as to the completion of a social 
transaction depends upon the receiver. He pays for the 
message by spending time taking it in. Human time is a 
moderately scarce commodity, and cannot be wasted in- 
definitely. People tend to switch dials and scan the 
newspapers and magazines until they find messages that 
are interesting to themselves. Message types that gain 
few receivers tend to be dropped by senders in favor of 
those which get more attention. Therefore, the social 
value of broadcast messages may be determined, as a 
first approximation, by the amount of time people de- 
vote to them. 

Thus a comprehensive time budget of the members of 
the society — how they allocate time to receiving mes- 
sages of various kinds, and time to other matters that do 
not involve social communications — provides a simple, 
additive criterion of value. We could attach the probable 
number of receivers, with some estimates of the time and 
place of reception, to the records of the messages them- 
selves that are held in storage for our instrument. 

The formulation of a society-wide time budget has 
already been explored by Meier [5]. A quantitative de- 
scription of time-use has applications in public affairs 
independent of its employment in our system, and the 
techniques required for making economical measure- 
ments already exist. 

Economists have found that macro-analysis is greatly 
assisted by subdividing the economy into such sectors 
as agriculture, manufacturing, households, etc. The 
rules for simplifying the accounting may be different in 
each sector. The choice of sectors in social analysis will 
depend upon the kinds of reinforcement provided by 
other approaches to social measurement, such as public 
opinion evaluation, the Census, and historical analysis. 
A first guess regarding sectors is provided in Table I. 


TABLE I 

Communications-Oriented Allocation of Time 
in Public Activities 


Work 

Travel 

Play 

School 

Reading 

Ritual 

Radio and TV 

Meetings and Parties 

Personal Services 

Shopping 

Dining and Drinking 

Miscellaneous 


Possible Subcategories under Work 


Factory 

Housework 

Office 

Maintenance of Property 

Construction and Mining 

Services 

Agriculture and Forestry 

Miscellaneous 


Another feature of our instrument must be intro- 
duced. If it is to be economically constructed, it should 
be decentralized. The headquarters would contain com- 
munications which are subject to national distribution, 
plus some measurements of exports and imports over 
continental boundaries, while branches would exist in 
every metropolitan area (Fig. 2). 

In the course of proposing a design for this apparatus, 
we have piled feature upon feature so that it has by now 
become quite elaborate. It is expected to intercept and 
store a huge volume of messages, but this is made feasi- 
ble by eliminating most of the redundancy in social com- 
munications, and reducing all the messages to a com- 
mon code. The instrument attaches weights to these 
messages according to the number of persons and the 
amount of time spent receiving them; it indicates the 
times and places the message is received, and it must 
store all of this in a permanent record which can be 
scanned quickly and automatically. Fortunately it need 
not get every message that is received, but may start 
modestly by sampling at, say, a one per million rate. As 
the instrument is refined, and the representation of so- 
cial change that is required must be finer-grained, the 
sampling rate may be advanced. 

We are now ready to discuss who would use such an 
instrument and for what purposes. Planners and admin- 
istrators who must make decisions for the public regard- 
ing parks, playgrounds, schools, traffic patterns, and 
various social services should be able to develop criteria 
for deciding from studies of trends in social communica- 
tions and from comparisons with other sources of social 
data. Advertisers may be expected to develop their 
craft on the basis of the more detailed measurements of 
response they would be able to obtain. Politicians should 
be able to sense better the distribution of sentiment on 
various issues. Educators may assess the impact of 
special programs. Changing tastes, the appearance of 
new patterns of social interaction, and the passage of 
fads should all be registered as factual data — “how 
much,” “where,” and “when.” The natural emphasis is 
upon local public affairs, mass entertainment, and the 
functioning of work, school, and commerce, because 
these matters make up the bulk of our communications 
activity. 
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Nationwide Network 
Distribution 


Government and 
Business Operations 


Television 

Radio 

Films 

Magazines 

Books 

Markets 



Correspondence 

Reports 

Directives 

Catalogs 

Educational Materials 
Mail Advertising 



Foreign Imports > CENTRAL STORAGE 



Local Radio 
Local TV 
Newspapers 
Regional Mags. 


METROPOLITAN STORAGE POINTS 



Wholesale Markets 
Public Events 
Conventions 
Universities 


Schools 

Meetings 

Churches 

Shopping 


COMMUNITY ACTIVITIES 


Restaurants 

Bars 

Sports 

Visiting 


Fig. 2 — Organization of social communications as imposed 
by the location of various activities. 


Time 



Fig. 3 — The “trial balloon” stimulus as revealed by content analysis. 
Assume the stimulus contains concepts whose treatment in com- 
munications uses terms A, B, and C with high probability. 


A skilled operator would ask his questions in terms of 
key words or phrases appearing in the content with a 
frequency of 10~ 6 to 10~ 8 . They serve as “tracers” of 
message content as it is spread through the population. 
Maps and time series can be prepared which show their 
buildup and decline. More detailed information about 
the changing attitudes of people may be obtained by 
reconstructing the contexts within which the key words 
appeared. 

The severest criticism to be made of a representative 
record of social communications is that the content of 
the messages tends to be superficial. In many, if not 
most, social transactions people disguise their true feel- 
ings about a subject. An investigator may nevertheless 
make many nontrivial observations, and can probe 
more deeply, if he desires, by using the “trial balloon” 
technique (Fig. 3). An event, closely relevant to the 
subject of interest, is purposely created — it may be an 
announcement, an incident, or a rumor. The subsequent 
wave of “talk” that is stirred up may then be analyzed. 
The effect that is triggered off provides a good indica- 
tion of the sensitivity of the public to that issue at that 
time. 

These and other small-scale tactical uses in govern- 
ment and commerce should grow rapidly to the point 
where large installations may be justified which allow 
hundreds of simultaneous operators. 

The strategic uses of an instrument of this sort are 
still more interesting. The accumulation of socio-cul- 
tural “wealth,” for example, may be estimated in a 
manner analogous to that developed by economists, and 
the flows of information through society may also be 
estimated. A very brief outline of the steps involved, 
and the kinds of conclusions to be obtained, will be pre- 
sented. 


What is the total of all nonredundant information 
that is transmitted in society for a year? The limitation 
upon flow is the capacity of the receiver to understand 
the messages to which he exposed himself. A receiver has 
a limited repertory of terms. Reasonably good statistics 
exist only for English vocabulary. 

The respective terms that are used in messages can 
be mapped according to their probability of occurrence 
in social communications, as in Fig. 4. The abscissa is 
some arbitrarily defined categorization of meanings, 
similar to the Dewey decimal system. When this same 
map is put onto polar coordinates, we can show stages 
in the development of a receiver as in Fig. 5. The pro- 
tuberances on the periphery are associated with the 
specialties engaged in by the person. The map of transi- 
tion probabilities between terms would have the same 
appearance. 

There is a standing rule in society that a sender should 
have greater knowledge about the subject of the mes- 
sage than the receiver, if information is to be transmitted. 
Thus, on the average, the senders are more informed 
and more expert than the receiver, as shown in Fig. 6. 
Continued communication would cause the receiver's 
repertory of terms to grow in the direction of the 
sender's. He would learn something about the subject. 
Senders must choose their terms so that they lie on 
the periphery of the receiver's map, if they are to save 
time and maintain interest. 

We are now in a position to estimate the amount of 
information flowing that is potentially useful to re- 
ceivers. Let us define a restricted number of classes of 
receivers, say about a hundred, each representing a dif- 
ferent segment of society, ranging from illiterates to 
various kinds of professionals, but exhaustive of the 
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negative 
logarithm g 

of 

frequency 3 

of symbolic 4 

terms used, 
in social 3 

coommicatlons 2 

I 



The levels may he 
obtained through 
vocabulary tests ( 
completion tests, 
etc. They are 
excellent Indices 
of Intellectual 
achievement In the 
respective direc- 
tions. 


Categories of social communications, spaced according 
to the allocation of the volume of transmitted terms 
to each of them respectively. 


In American culture A might include terms used In 
newspapers, magazines and popular books, B those 
used in conversation, radio, and television, C those 
employed in business transactions ... 3 those used 
in the fine arts, etc. 


Fig. 4 — Typical properties of a receiver of social communications. 




(a) 



(b) 

Fig. 6 — Relationships between repertories in social communications, 
(a) The receiver expands his map in the indicated direction as a 
consequence of communication, (b) Simplified maps showing how 
the sender chooses terms which lie on the boundary of the vo- 
cabulary that is shared, if he wishes to optimize the transfer of 
information. 


feet, and directing has this function for mass media. 
The pauses unconsciously inserted into human speech 
have recently been shown to work on the same principle 
in. This property, combined with miscellaneous other 
available information about mass public behavior in 
American metropolitan areas, enables us to arrive at the 
first approximation of information flow (Table II). 


TABLE II 

Information Transmission in Metropolitan Society* 
(Population 5,000,000) 


Fig. 5 — Typical development of the vocabulary map in an individual. 
The powers of ten shown are levels for the frequencies with which 
terms appear. If the Zipf distribution (rank order times frequency 
is a constant) holds, each shell contains the indicated number of 
terms. Categories or contexts (A, B, C, • • • K) are assigned by 
convention as before. 

population. Each would have a distinctive map. The 
messages transmitted in society and stored in our social 
record must have the receivers which choose to spend 
time on them classified according to category. Shannon 
[6] has described a method for using typical receivers 
for the measurement of redundancy. 

Interestingly enough, the significant information 
flow rate tends to stabilize itself for a given context. 
Editing the rough spots out of manuscripts has this ef- 


Mode of 
Reception 

Time Allocated 
hours/year 

Estimated 
Receiving Rate 
bits /minute 

Estimated Flow 
bits/year 

Reading 

4X10 9 

1500 

36X10 13 

Television 

3X10 9 

500 

9X10 13 

Lecture and 




Discussion 

4X10® 

200 

5 X 10 13 

Observation of 




Environment 

3X10 9 

100 

2 X 10 13 

Radio 

1.5X10® 

300 

3X10 13 

Films 

1.6X10 8 

800 

8X10 12 

Miscellaneous 

5X10® 

100 

3 X 10 13 




6 X 10 14 


per capita average~10 8 bits/year 


* Judged in terms of the probable repertories of receivers, not in the 
accepted sense used in information theory. Possibly a new term should 
be coined for information distributed over a population. 
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Comparisons between the poorest and richest metro- 
politan areas can also be exceedingly suggestive (Table 
III). Observers now agree that socio-cultural growth 
parallels economic growth but the introduction of 
measurements suggests that social communications must 
either precede economic growth or grow more rapidly 
than income. Apparently an expansion of socio-cultural 
activity is a necessary but not sufficient precursor of 
economic development. 


TABLE III 

Income and Information Flow Extremes in Urban Society 



San Francisco 

Addis Ababa or 
Jakarta 

Income 

$3000 capita/year 

$150 capita/year 

Non-redundant* 



information receipt 

~10 8 bits/year 

~10 6 bits/year 


* Again in terms of probable repertories of receivers. This assumes 
that 70-80 per cent of residents in the poorer cities are illiterate. 


The heavy volume of information transmitted by 
reading is highly significant. A society like our own 
which is increasingly white collar reads more at work 
and at home. There are limits to human ability to re- 
ceive information, however, which are believed to be in 
the neighborhood of 10 9 bits per capita per annum for a 
population with the present distribution of mental ca- 
pacities. At the present estimated rate of gain, this 


theoretical saturation level is likely to be reached within 
two generations. The prospect is startling enough to 
cause us to investigate more closely the stresses associ- 
ated with communication saturation in human organ- 
izations. 

My feeling is that our instrument is already techni- 
cally feasible. Simple calculations show that sampling 
social communications at a rate of ten parts per million 
presents storage requirements within range of existing 
equipment, but the desired degree of access remains 
unclear. Message collection in the field is not a prob- 
lem, but the programming for storage and the catalogu- 
ing of nonverbal materials has been inadequately de- 
veloped. Much experimental work and formal analysis 
will be required before a truly comprehensive cross sec- 
tion of social change can be achieved. 
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Simulation of Sampled-Data Systems Using 
Analog-to-Digital Converters 

MICHAEL S. SHUMATEf 


Introduction 

U NTIL recently, systems simulation problems 
could generally be split into two classes : problems 
requiring only an analog computer for solution 
and problems requiring only a digital computer for solu- 
tion. Any general problem has a number of character- 
istics which adapt it to either one or the other method 
of solution. 

A systems problem is most readily adapted to analog 
computer simulation when it requires a relatively short 
solution time on the computer and a relatively inac- 
curate solution is acceptable; has relatively “high” fre- 
quencies; and has nonlinearities such as, for example, 

f Consultant for Space Technology Labs., Inc., and California 
Inst, of Tech., Pasadena, Calif. 


saturation, deadzone, or hysteresis. To be adapted to 
digital computer simulation, a systems problem usually 
possesses relatively low frequencies and requires a long 
solution time; can be adapted to an iterative form of 
simulation without introduction of an instability (this 
is usually implied by a lack of nonlinearities such as 
those mentioned above) ; and has a range of variable 
which exceeds that possessed by an analog computer 
solution. 

Certain problems involving combinations of both 
groups of properties may often be split into two sep- 
arate problems, one involving high-frequency nonlinear 
effects, and one involving low-frequency effects. 

An example of such a problem is the simulation of the 
flight of a liquid-propelled ballistic missile. The missile’s 
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reactions to external and internal forces can be simu- 
lated on an analog computer, and the motion of the 
missile along a trajectory can be simulated on a digital 
computer. 

However, recent developments in control systems, 
particularly systems using sampled or discrete data, 
have generated a third class of simulation problems; 
this class of problem usually involves both sampled and 
continuous information, has nonlinearities and no gen- 
eral dividing line between high and low frequencies, and 
usually has a range of variable which exceeds that pos- 
sessed by an analog computer. An example of such a 
problem is the simulation of an inertially guided missile. 
The guidance and control systems must have rapid re- 
sponse characteristics, and the system may involve a 
special-purpose digital computer, whose program makes 
it act like a multimode, adaptive controller. 

Simulation of Sampled Data Systems 

Large analog simulation facilities are thus faced with 
the problem of obtaining reasonable representations of 
such systems, and must thus have the capability of at 
least sampling continuous information, and hopefully 
be able to perform some intelligent and useful opera- 
tions on such information. 

Several straightforward methods of performing sam- 
pled-data simulation are presently available. Sampling 
may be accomplished by a so-called hold-amplifier [see 
Fig. 1(a) ] or a passive hold circuit [see Fig. 1(b) ]. These 
circuits are not easily adapted to simulation of digital 
computer operations, because of difficulty involved in 
holding past values of sampled information. This diffi- 
culty is partially eliminated by using transfluxors 1 as 
hold devices. 

The most general method for simulation of sampled- 
data systems is to use analog-to-digital converters to 
connect an analog computer to a general-purpose digital 
computer (see Fig. 2). In such an installation, one or 
more analog-to-digital converters are used to sample 
continuous information and present it to a digital com- 
puter. Several more analog-to-digital converters are 
used open loop as digital-to-analog converters to present 
and hold digital information to the analog computer. 
Two major disadvantages of this system are: 

1) The difficulty in starting the two computers simul- 
taneously, and 

2) The difficulty in scheduling operation time on the 
digital computer, since in most installations digital 
computing time is a premium quantity. 

The first of these difficulties may be remedied by 
equipping the digital computer with an “interrupt” fea- 
ture, and the analog computer with a “start” control. 
The interrupt feature allows the digital computer to 

1 J. A. Rajchman and A. W. Lo, “The transfluxor,” Proc. IRE, 
vol. 44, pp. 321-332; March. 1956. 
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Fig. 1 — Two examples of sample-and-hold circuits. 



Fig. 2 — General-purpose sampled-data simulator. 


break its program when a pulse (perhaps a sample 
pulse) is received, and jumps to a special subroutine 
used for external communication purposes. The start 
control allows the analog computer hold relays to actu- 
ate simultaneously with the same pulse that first inter- 
rupted the digital computer. 

The second difficulty is easily overcome if one is will- 
ing to pay for the digital computer time. It seems a little 
ridiculous, however, to use a large general-purpose 
digital computer if the digital program involved is rela- 
tively simple. For example, in order to use this system 
to simulate a sample-and-hold, the digital computer 
must be programmed to perform a transfer of a number 
from an analog-to-digital converter to a digital-to-ana- 
log converter, and would thus sit idle during most of a 
sample period. 

It therefore becomes evident that such a large, gen- 
eral-purpose sampled data simulator should only be 
used for complex, sophisticated problems, and some 
auxiliary equipment should be developed to simulate 
simpler sampling operations. 
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Equipment 

The construction of an auxiliary sampled-data simu- 
lator is, of course, dictated by equipment present in a 
simulation facility. The facility currently available to 
the author consists of a 300 amplifier Electronic Associ- 
ates Analog Computer, an EPSCO Addaverter conver- 
sion system, and a Remington- Rand Uni vac Scientific 
Digital Computer, Model 1103A. 

The approach used in the construction of an auxiliary 
sampled data simulator was based on the fact that the 
facility currently possessed equipment which would 
both sample analog voltages and hold analog voltages. 
This equipment, the Addaverter (see description be- 
low), would, of course, be in use part of the time as a 
communication link to the 1103A, but its time schedule 
was sufficiently open to warrant thinking of using it for 
other purposes. 

The Addaverter is used to simulate a group of parallel 
sample-and-hold channels. A sample-and-hold is ob- 
tained by effectively placing an analog-to-digital con- 
verter in series with a digital-to-analog converter. The 
analog-to-digital converter is caused to sample its input 
voltage, the resulting digital number is then transferred 
to the digital-to-analog converter, and converted back 
into a voltage, which remains unchanged until the com- 
plete cycle is repeated again. Note that delaying of the 
conversion of the number back into a voltage is equiva- 
lent to delaying the sampled values of the input voltage 
and is useful for simulation of transportation lag, etc. 

In order to obtain a better description of the imple- 
mentation of the above concept into a complete sampled 
data simulator, some description of the Addaverter is 
necessary. 

The Addaverter consists of 30 analog-to-digital con- 
verters (hereafter abbreviated ADC), 15 used as such, 
and the other 15 used open loop, as digital-to-analog 
converters (abbreviated DAC). 

All 15 ADC’s sample 2 the voltages present at their 
inputs simultaneously, under the control of a central 
sample controller, which is triggered by a single pulse, 
called a sample pulse; the resulting digital numbers re- 
main intact in the memory of each ADC until the next 
sample pulse occurs. 

The DAC’s operate individually, each converting the 
digital number previously stored in its memory to an 
analog voltage when it receives a pulse, called a present 
pulse. The voltage at a DAC’s output remains un- 
changed until a new number has been read into its 
memory and a present pulse has been received. A single 
present pulse input and 15 pulse inhibit inputs are used, 
instead of 15 pulse inputs. 

In order to transfer the digital numbers from the 
ADC memories to the DAC memories without using the 
digital computer, an additional piece of equipment, nor- 

2 The Addaverter uses the “ripple-down” method for analog-to- 

digital conversion and requires 180 Msec to complete one conversion. 


mally used for Addaverter maintenance purposes, is em- 
ployed. One particular operational mode of this equip- 
ment, when triggered by a single pulse, transfers the 
numbers stored in the ADC memories into the DAC 
memories in a sequential fashion : the content of the first 
ADC memory is transferred into an auxiliary register, 
and then into the memory of the first DAC ; 3 the process 
is then repeated for each of the other 14 ADC-DAC 
channels. Provision is made to prevent the number 
transfer for any individual channel by energizing an in- 
hibit gate associated with that channel. 

These three units, the 15 ADC’s, the 15 ADC’s used 
as DAC’s, and the number transfer equipment, com- 
prise the basic sampled-data simulator. In order to ob- 
tain satisfactory operation, it is necessary to supply a 
burst of three pulses each time it is desired for the sim- 
ulator to sample. Another piece of equipment was con- 
structed to supply the pulse burst necessary to control 
the simulator. This equipment consists of necessary con- 
trol logic for the simulator, and a large quantity of pulse 
and dc logic with a prepatch capability, to permit flexi- 
ble operation of the entire system. A source of timed 
pulses is available, to use as system clock; a “start” sys- 
tem is available, to synchronize starting of analog simu- 
lations with the clock pulses. 

An ADC-DAC channel may thus be used to sample- 
and-hold an analog voltage with either of two transfer 
functions: 


1 


B i0) = 

— (1 — e~ sT ), and 

(1) 


5 



1 


H 2 (s) = 

— (1 - e~ sT )e~ ST r < T 

(2) 




where T is the sampling period, and r a delay time. 

No problems are incurred if it is desired to have all 15 
ADC-DAC channels operating in the same mode. How- 
ever, it is sometimes desired to simulate multirate sam- 
pled-data systems, or monorate systems that have sev- 
eral samplers each with a different time delay. No diffi- 
culties would arise if each ADC-DAC channel operated 
independently of every other channel. This is not the 
case, however, since all channels sample simultaneously. 

The simulator control equipment was. so designed to 
permit simulation of up to four different sample-and- 
hold operations; however, their operating frequencies 
must be restricted to integral multiples of some fre- 
quency. 4 A block diagram of the simulator control is 
shown in Fig. 3. The simulator control has a set of 
sample pulse inputs, a set of present pulse inputs, and a 
set of “enable” outputs, all located on the prepatch 
panel. The inhibit inputs for the present pulse (DAC 

3 This process requires 40 nsec. 

4 This restriction is brought about by the simultaneous sampling 
of the ADC’s. It is possible to use two asynchronous frequencies, but 
some external pulse logic is necessary to prevent double pulsing of the 
sample control. 
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Fig. 3 — Block diagram of simulator control logic. 


Fig. 4 — Example pulse logic diagrams. 


inhibit) , and inputs for the transfer inhibit are wired in 
parallel 5 and brought up to the prepatch panel. The 
four “enable” outputs, one for each operating group, 
must be patched to the inhibit inputs of the channels, 
as predetermined by the programmer. If a particular 
channel is to operate according to the group 1 mode, its 
inhibit input must be patched to the group 1 enable. 
Several channels may operate from the same group en- 
able. 

A particular group’s operating mode is completely de- 
termined by what pulses are fed to its sample input and 
its present input. For example, to operate a group as a 
sample-and-hold [see Fig. 4(a)], a clock pulse must be 
fed to the group sample input each time it is desired 
for a sample to occur. A present pulse must then be 
fed to the group present input 815 //sec after each 
sample pulse. The sample input pulse causes all group 
enable outputs to be reset, then the particular group 
enable (whose input was pulsed) is set, and the ADC’s 
are caused to sample (all 15 sample) ; 180 //sec later, the 
number transfer is initiated. (Only those channels asso- 
ciated with the operating group have numbers trans- 
ferred.) The present input pulse, which must be delayed 
at least 815 //sec to allow the sample and number trans- 
fer to be completed, then causes the DAC’s connected 
to the operating group to present the numbers trans- 
ferred as voltages. 

Note that the sample-and-hold is actually a sample- 
and-hold plus a short delay; this delay is not long 
enough to introduce unwanted effects in the majority of 
problems simulated. If it is desired to introduce more 
delay, it is only necessary to increase the delay of the 
group present input pulse. The maximum delay possible 



nun. delay min. delay min. delay 


Fig. 5— Wiring of sample-and-hold channels to 
simulate long time delays. 

for a group present input pulse is the sampling period 
for the particular group. 

To operate two groups at different, related fre- 
quencies, the group running at the higher frequency is 
operated as described above, but the clock pulses for 
the lower frequency group must be obtained by divid- 
ing the higher frequency clock pulses by some integral 
number. [See Fig. 4(b).] The start control input is 
shown in Fig. 4(a) and 4(b). 

A great many other modes of operation are possible, 
but will not be presented here because of space restric- 
tions. 

Simulation of Simple Digital Computer 
Operations 6 

Simulation of sampled-data systems which incor- 
porate a digital computer may be accomplished by us- 
ing sample-and-hold channels to simulate the digital 
computer’s transfer function (provided the transfer 
function is not too complicated). Simple digital filters, 
second- or third-order difference equations, etc., may 
be simulated. 

If a group of ADC-DAC channels are wired in series 
[the output of the first wired to the input of the second 
(see Fig. 5, with the control logic being pulsed by logic 
circuitry given in Fig. 4(a) of the example logic dia- 
grams, the variable delay set to 815 //sec, and all chan- 
nel inhibit inputs wired to the operating group enable 


6 Logically, if a number is transferred into a DAC, it is done so 
with the intention of presenting it. 


6 J. R. Ragazzini and G. F. Franklin, “Sample-Data Control 
Systems,” McGraw-Hill Book Co., Inc., New York, N. Y. ; 1958 
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output], then it can be easily shown that successive 
channel outputs are each delayed from the previous 
channel output by one sample period. 

Erf in this treatise will be used to denote the sampled 
and held value of E ; the subscript n refers to the nth. sam- 
ple pulse. 

Suppose E is zero, and has been for a considerable 
length of time. Then E n *, E n - 1 *, etc., will all be zero. 
Now suppose that during the interval between the nth 
and the wth + 1 samples, E changes to some nonzero 
variable value. Then, after the present pulse associated 
with the wth + 1 sample, channel l’s output will change 
to E n+ *. However, during the sampling associated with 
the wth + 1 sample, channel l’s output was still zero, 
hence the output of channel 2 will remain at zero after 
the wth + 1 present pulse. 

After time n-\- 2, the output of channel 2 will have the 
value E n+ i*, since that value was still at the output of 
channel 1 when the nth -{-2 sampling occurred. The out- 
put of channel 3, E n *, will still be zero. (This operation 
is a consequence of the fact that each sample-and-hold 
channel has a slight transportation lag.) 

Therefore, a series string of sample-and-hold chan- 
nels will act like a delay line which propagates values of 
E that have been sampled and held. Further considera- 
tion will show that this is equivalent to the way a digital 
computer would remember past values of a variable. 

Thus 


Output of channel 1 = E n * 


Solving for E*, 

E 0 * = a 0 Ei* + 


( 6 ) 


The block diagram for this is shown in Fig. 6. The 
implementation of a second-order difference equation 
can be accomplished with three sample-and-hold chan- 
nels instead of four; the diagram in Fig. 6 was chosen 
because it made the error analysis more expedient. 


Accuracy of Simulation 

Each Addaverter unit (ADC or DAC) is accurate to 
within ±0.1 per cent of its nominal input (or output) 
voltage. Therefore, the voltage out of each sample-and- 
hold channel is accurate to within 0.2 per cent of the 
voltage put into the channel. Because of the 0.2 per cent 
inaccuracy of the Addaverter, some uncertainty in the 
solution of a difference equation may arise. The follow- 
ing discussion treats the limitations caused by this inac- 
curacy. 

A sample-and-hold channel may be visualized as hav- 
ing its output made up of the sum of a voltage which is 
identical to the voltage at the input during the last 
sample and an unknown random voltage. This is graphi- 
cally explained in Fig. 7. 

Using Fig. 7 as a model sample-and-hold channel, the 
complete diagram for a second-order difference equa- 
tion is shown in Fig. 8. 

The expression for the output is 


Output of channel 2 = E n *e~ sT 
Output of channel 3 = E n *e~ 2sT . 

Using z transform notation (s = e sT ; see Ragazzini and 
Franklin 6 ) 

Output of channel 1 = E n * 

1 

Output of channel 2 = E n * — 

z 

1 

Output of channel 3 = E n * — • 

z 2 


The first channel has the transfer function 


G 1 = — (1 - *-•*) (3) 

s 


and each succeeding channel has the transfer function 


E 0 * = Ei*G(z) + RCG(z) + Rn* 


ai 


b\ b% 
1-1 b — 

z z l 


bi + 


-Rm* 


z 


b i bi 

id h — 

z z 2 


-RiC 


b i b-j. 

1 q 1_ 

z z 2 


(7) 


Further analysis would be impossible without some 
simplifying assumptions about the character of Ri. 

An Ri is a function of the voltage at the channel input 
and hence cannot be assumed Gaussian. Observations 
taken from the Addaverter have shown that the Ri are, 
to a first approximation, constant offsets. Therefore, 
let 


Rf = 



( 8 ) 


G<i — Gi — 



(4) 


Suppose it is desired to simulate the transfer function 


Gif) = 


z(<z 0 z + a \ ) 
z 2 ±- b\Z T b% 


Eq* 


Ei* 


(5) 


where e< = constant offset associated with the 7th chan- 
nel. 

Furthermore, to make analysis expedient, the second- 
order difference equation is assumed to be that of a 
damped sinusoid: 

ze~ aT sin aT 

G(z) 

z 2 — 2 ze~ aT cos aT + e~ 2aT 


(9) 
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Fig. 6 — Block diagram for simulation of second- 
order difference equation. 


Substituting (8) and (9) into (7), 

Eq* = E l *G(z ) d e\G(z) 

z — 1 


E 

Perfect 

+ / 


Hold 

( 


E*+R* 

■O 


Fig. 7 — Model of a noisy sample-and-hold channel. 



z z 2 e~ aT sin aT 

_| €n 

2—1 z 2 — 2 ze~ aT cos aT + e~ 2aT 

z 2z 2 e~ aT cos aT — e~ 2aT z 

d em — — 

z — 1 z 2 — 2 ze~ aT cos aT -f- e~ 2aT 


G <z) = , -2- : — - 

iT + b z + b. 

1 2 

Fig. 8 — Block diagram of second-order difference 
equation, including random disturbances. 


z e 2aT z 2 

eiv (10) 

z — 1 z 2 — 2 ze~ aT cos aT -f- e~ 2aT 

The first term of (10) represents the desired output of 
the simulation. The remaining terms represent the error 
caused by the offsets in the channel outputs. The error 
is split into two parts: a constant term and an oscilla- 
tory term. These two parts are evident in the partial 
fraction expansion of the error terms of (10). 


larity that they give the largest error: namely, suppose 
€m = -\-\pf and cry = —tyf where \{/f is the average channel 
offset. Furthermore, suppose the total error of simula- 
tion is to be no larger than 100 1 pf ( i.e ., the error- to-offset 
ratio is 100). 

Then substituting into (12) 


100 > 


2e aT \p f 

1 - 2e~ aT + e~ 2aT 


(13) 


_ , _ * [(e~ aT sin aT)(e i + e n ) + ( 2e~ aT cos ar)(e m ) - (e~ 2aT ) (em eiv)] 2 

Error of E 0 * — 

1 — 2e~ aT cos aT d - e~ 2aT z — 1 

’ { ( e~ aT sin aT)(ei) + (e~ aT sin aT)( 2e~ aT cos aT — e~ 2aT )(en) 

d- [(2g~“ T cos aT - e~ 2aT )(2e-« T cos aT) - r 2a2, ](e m ) 

— (2e~ aT cos aT — e~ 2aT )(e\ v)}z 2 

. -f- [( — e~ 2aT e~ aT sin aT)(e i d~ en) — ( 2e~ 2aT e~ aT cos aT)(em) (r 2 “ T )(em + eiy)]z, 

(1 — 2e~ aT cos aT e~ 2aT )(z 2 — 2ze~ aT cos aT -f- e ~ 2aT ) 


The first term of (11) contributes a constant error, 
which would be most noticeable if aT were small. If 
a = 0, then the error due to the first term would be 


error = 


(2e~ aT - r 2 “ r )em - e- 2 “ r eiy 


( 12 ) 


1 - 2e~ aT -)- e~ 2aT 
Suppose em and eiv are offsets which are of such po- 


Solving for e~ aT , 

e~ aT < 0.868. (14) 

e~ aT is the z-plane location of the double pole for the 
case when a = 0. Hence, if it is desired to have an error 
no greater than 100 times the average channel offset, 
then the real part of the pole locations, for small aT, 
must be less than 0.868. 
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Fig. 9 — Forbidden region of z plane for poles of 
a second-order difference equation. 


Now consider the case where a — 0. The major source 
of error in the solution comes from the second term of 
(11). This error is 


Similar calculations made for an error contribution 
which is not to exceed 10 times the average channel off- 
set, \pf, yield the limits 

e~ aT < 0.641, and 

aT > 25.9°. (18) 

A similar analysis for a first-order lag may be made 
by letting ai = b 2 = 0 and a 0 = l in (5). For an error-to- 
offset ratio of 100 

bipf 

100 > V- • (19) 

Solving for hi, 

bi < 0.99. (20) 

Hence, for a first-order lag, the pole should be located 
at points less than 0.99 for a ratio of 100. This means 
that this technique of simulating difference equations 
cannot be used to do integration (the above case, for 
instance, where &i = l). 


{ [ei + (2 cos aT — l)en] sin aT + [(2 cos aT — 1)(2 cos aT) — l]em — (2 cos aT — l)ery}z 2 


error = 


+ { — (ei + en) sin aT — (2 cos aT — l)em + eiy}z 

(2 — 2 cos aT){z 2 — 2z cos aT + 1) 


(15) 


By assuming that cos aT is approximately unity, and 
that €111=^/ and ery= — ; fo (as before) and neglecting 
(ei+en) sin aT , the numerator of (15) reduces to 

2(z 2 - z)p f 

which is the same as the numerator of the transform for 
cos aTn. Hence, this error contributes 

i Pf 

cos aTn 

1 — cos aT 

to the output of the simulation. If as before, it is desired 
that this contribution be less than 100 times the average 
channel offset, i p f , then 

100*, > ■ (16) 

1 — cos al 

Solving for aT, 

aT > 8.1°. (17) 

Thus, using (14) and (17), a region of the z plane can 
be enclosed, and dubiously named a forbidden region for 
the existence of poles of a second-order system. Fig. 9 
illustrates this forbidden region. 


Figs. 10 and 11 are illustrations of the type of solu- 
tion uncertainty caused by the sample-and-hold channel 
offsets. For both figures, an undamped cosine function 
was used and the average channel offset was 0.02 volt. 
For Fig. 10, the value of aT was 5°, locating the poles 
inside the forbidden region; for Fig. 11 aT =10°, locat- 
ing the poles just outside the forbidden region. The dif- 
ference in solution uncertainty is clearly illustrated. The 
same type and magnitude of uncertainty arises in a 
damped sinusoid solution. 

It is indeed unfortunate that the forbidden region ex- 
ists where it does, since it excludes the area of the z 
plane most commonly used. Time scaling may some- 
times be employed to shift system poles to more favor- 
able locations, but this often leads to difficulties in the 
analog computer portion of a simulation, since fre- 
quency must be scaled up. 

Conclusions 

Simulation facilities possessing general-purpose sam- 
pled data simulators in the form of a conversion system 
and a digital computer may require an auxiliary sam- 
pled-data simulator for simulation of simpler system 
problems. 
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Fig. 10 — Cos aTn.aT = 5°. Average channel offset = 0.02 volt. 



Fig. 11 — Cos aTn.aT = 10°. Average channel offset =0.02 volt. 


An auxiliary simulator which uses the main portions 
of the analog-to-digital conversion system satisfies 
most of the requirements that a sampled-data simulator 
must meet. It has the advantage that it uses the conver- 
sion system during otherwise idle periods. 

The simulator consists basically of a number of sam- 
ple-and-hold channels which, subject to certain inac- 
curacies, may introduce errors into a simulation. The 
magnitudes of these errors can be determined in cases 
only with prior knowledge of the complete system. 

In the simulation of digital transfer functions, the ef- 
fects of these inaccuracies on the representation of 
transfer functions may be analyzed for any given trans- 
fer function. Given some bound on the over-all error, a 
region of existence of the z-plane poles of the transfer 
function may be outlined. 
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FOXY 2: A Transistorized Analog Memory 
for Functions of Two Variables 

L. J. KAMMf, P. C. SHERERTZf, and L. E. STEFFEN f 


F OXY 2 is a high-speed function generator with 
two independent variables for use in analog com- 
puters and simulators. 

FOXY 2 stands for Function Of X Y, 2nd type. It is 
a high-speed transistorized version of FOXY 1, which is 
a relay device. 

The mathematical approach is shown in Fig. 1. The 
x, y plane is ruled in a grid and the value of z at each 
corner point is defined in the memory. 


f Convair, San Diego, Calif. 


For any point (x, y) the 4 surrounding corner values 
Za, Zb, zc, Zd are switched to the output circuit, and 
z(x, y) is obtained by so-called bilinear interpolation as 
follows: z" is obtained by linear interpolation between 
za and zb, and z' is obtained by linear interpolation be- 
tween Zc and Zd ■ Then z is obtained by linear interpola- 
tion between z' and z". 

The memory comprises a jack for each corner point 
and a set of fixed voltage jacks. To record a value of 47 
for Z 5 , 9 for example, corner jack 5, 9 is wired to voltage 
jack 47. To provide for wiring several corner jacks to a 
single voltage jack, the corner jacks are actually twin 
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jacks and chain wiring is used from a voltage jack to a 
first-corner jack to a second-corner jack to a third- 
corner jack, etc. 

There are 201 fixed-voltage jacks having values from 
— 100 through 0 to +100. 

The x, y plane is divided up by 21 rows and 21 col- 
umns of corner points, so that z is defined at 441 points. 

A bus is provided for each x and each y grid line. A 
corner-point voltage is switched out from the memory 
when both its x and its y busses are energized. To pro- 
vide for the simultaneous selection of all four corners of 
a square needed for interpolation within the square, two 
adjacent x busses and two adjacent y busses are ener- 
gized at the same time. Energizing x busses 3 and 4 and 
y busses 1 and 2 switch out the four voltages on corner 
jacks (3, 1), (3, 2), (4, 1), and (4, 2). 

Ambiguity is prevented by the even-odd segregation 
scheme shown in Fig. 2. All corner points in the x, y 
plane are classified as +, B, C, or D. Every square has 
one and only one corner of each classification, although 
not in the same relative positions. The switching circuits 
for each classification are kept separate. Thus, in the 
above example corner 

(3, 1) is class A 
(3, 2) is class C 
(4, 1) is class B 
(4, 2) is class D. 

The simultaneous operation of busses x 3 , x 4 , yi, and y 2 
connect one and only one corner jack to each of four 
memory-output wires, A, B, C, and D. 

A block diagram of the system is shown in Fig. 3. The 
x and y input voltages each go to a corner selector which 
energizes the appropriate grid busses. The grid busses se- 
lectively operate the 21X21 array of corner switches. 
These connect the corner voltages (z values) to the out- 
put interpolator, 4 at a time (the A corner, B corner, C 
corner, and D corner of a grid square). The corner se- 
lector circuits also send signals via the pre-interpolators 
to the output interpolator which then generates a 
weighted average of the four corner voltages to produce 
the output, z. 
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Fig. 2 — Even-odd corner classification. 
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Fig. 3 — Block diagram. 


The corner point switching system is shown in Fig. 4. 
Here the outputs of the corner selectors of Fig. 3 are 
designated matrix decoders. The grid busses are desig- 
nated Xi, x 2 , • • • , X 21 and yi, y 2 , • • • , y 2 1 . Each corner 
switch is an AND gate comprising a 2N521A transistor 
whose base is connected to a y bus and to an x bus by 
IN 116 diodes and to a bias voltage by a resistor. 

When both its busses are energized, a transistor is 
turned on and its corner-jack voltage is imposed on one 
of the four output wires A, B, C, or D. All of the A- 
corner switches are connected to output wire +, and 
corners B, C, and D are correspondingly related to out- 
put wires B, C, and D. Thus, when two adjacent x busses 
and two adjacent y busses are energized, a corner volt- 
age is imposed on each of the four output wires. 

Each corner selector contains an analog-to-digital 
converter in which each count corresponds to a 10-volt 
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Fig. 4 — 21X21 switching array. 


Fig. 6 — Z interpolator. 



increment in the independent variable. See Fig. 5. The 
counter energizes the matrix decoder which is a set of 
gates. These gates energize a different pair of matrix 
busses, for each state of the counter. 

Interpolation is performed by time division and sum- 
mation of the voltages on the A, B, C, and D output 
wires. Fig. 6 shows the interpolator circuit. Each circle 
containing a Ax represents a transistor switch which 
rapidly turns on and off, with its on percentage deter- 
mined by the position of x between the two x-boundary 
values of its square, and similarly for y Fig. 7 shows the 
time-division coder circuit. The chopped A, B, C, and D 
voltages are filtered and then summed by the output 
amplifier to produce z. 

The interpolation process in somewhat more detail is 
as follows: the x-input voltage is combined with fixed 
voltages switched by the A-D converter to produce the 
sawtooth pre-interpolator input shown in Fig. 5. The 
pre-interpolator converts this to the triangular form 
designated pre-interpolator output. This is fed to the 



time-division multiplier of Fig. 7, whose output is a pair 
of pulsing voltages whose on time corresponds to the 
magnitude of the pre-interpolator output voltage. The 
pulsing voltages are fed to the two transistor switches in 
Fig. 6. 

The interpolation equations governing the system 
are: 



( 1 ) 

( 2 ) 

(3) 
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A Time-Sharing Analog Computer* 

JOHN V. REIHING, Jn.f 


I. Introduction 

T HE transient behavior of physical systems is often 
studied by the use of electronic analog computers. 
If the system considered is characterized by a con- 
tinuous distribution of properties, the describing system 
equations are of the partial differential class. An anal- 
ogous set of equations, soluble on the analog computer, 
can often be formed by the application of finite differ- 
ence approximations. For example, a physical system, 
originally space and time dependent, can sometimes be 
sectionalized into a number of space segments and then 
described by a set of ordinary differential and/or alge- 
braic equations. Such sectionalization ordinarily results 
in a number of equation sets of similar form. 

Most often the computational approach to the sec- 
tionalized problem is to associate with each section a 
block of analog equipment. Each section block is usu- 
ally composed of identical analog computer compo- 
nents. The total system is then simulated by cascad- 
ing the section blocks. For multisection systems, as are 
required for rapid transients, equipment requirements 
increase as n, the number of sections, increases. As a re- 
sult, some problems cannot be accommodated by the 
analog computer facility. Further, the large number of 
potentiometers resulting from the sectionalized cascade 
solution increases problem setup time and the probabil- 
ity of operator error in the potentiometer-set-phase of 
setup. 

A time-sharing analog solution, described in this 
paper, replaces the cascade of similar circuits by a single 
circuit whose components are time-shared. In conjunc- 
tion with the actual computing elements of the time- 
shared circuit are circuits to provide time delay and 
timing functions. This sharing permits a reduction in 
equipment requirements permitting a smaller invest- 
ment in computing equipment for a given problem size 
or increased problem capacity over that available with- 
out time-sharing. 

II. Description and Method of Operation 
A. A Typical Problem and Method of Solution 

The description and method of operation of a time- 
sharing analog computer designed to solve a typical set 
of pressurized water, forced convection reactor core heat 
transfer equations follows. The machine to be described 

* This paper is an abstract of a thesis presented at the University 
of Pittsburgh for the M.S. degree. The author is indebted to Drs. 
J. F. Calvert, T. W. Sze, and D. J. Ford, all of the University of 
Pittsburgh, for helpful criticism. 

f Bettis Atomic Power Div., Westinghouse Electric Corp., Pitts- 
burgh, Pa. Operated for the U. S. Atomic Energy Commission by the 
Westinghouse Electric Corp. under Contract AT-ll-l-GEN-14. 


illustrates the principal features of time-sharing and the 
solution of the sectionalized reactor core equations may 
be considered a typical application. 

The set of differential-difference equations to be 
solved is given below and is shown, along with a sketch 
of the model, in Fig. 1. Coolant flow is assumed constant 
for the analysis: 1 

dT m Jt) r .. 

—A- = <*&(<) - '[r„,(<) - r„,(<)] (i) 


Jf l = ot 3 ’[T mk (t) - r„,(<) j - a/fr.,© - (2) 


„ /A A.0 + ZV0 

1 wW) ~ 2 

( 3 ) 

+ 

-y 

£ 

II 

( 4 ) 


The forcing functions for the kth section are the inlet 
coolant temperature Tift) and the heat flux c[k{t). The 
output coolant temperature is T 0 ft), and the average 
metal and coolant temperatures are T m ft ) and T Wk (t ), 
respectively. 

The block diagram of Fig. 2 illustrates the conven- 
tional, cascaded method of solution. Each of the n sec- 
tions is composed of the same computing elements. Fig. 
3 indicates, in block diagram, the solution of the equa- 
tion set by time-sharing. The substitution of a single 
time-shared circuit for the tandem string of circuits is 
evident by comparing these two diagrams. 

Fig. 4 shows a four-section analog circuit diagram for 
a time-sharing machine to solve the set under considera- 
tion. This analog circuit can be considered as a combina- 
tion of two circuits, i.e., an equation solving section and 
an auxiliary or service section. The equation solving sec- 
tion consists of integrators A and B, summer D, and the 
gain potentiometers with settings cti, a A , a% , af . Inte- 
grator A solves (1) for T m ft ) given the heat flux and the 
film drop. Eq. (2) is solved by integrator B for T Wk (t) 
by integrating the film drop and the coolant tempera- 
ture rise across the kth. section. Summer D produces the 
outlet coolant temperature T 0 ft) by solving (3). The 
auxiliary or service circuit, peculiar to this time-sharing 
machine, includes six special devices. The devices and 
their functions are as follows. 

1) Delay circuits Da, Db, Do, and Db receive, store, 
and discharge voltages at times determined by 
control signals. The delay circuits can be classified 
into two types by considering the nature of the 
signals upon which they operate. First are those 

1 See Appendix for a list of symbols. 
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Fig. 1 — Reactor core heat transfer equations and model sketch. 
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Fig. 2 — Reactor plant simulator with multisection 
reactor core tandem section computer. 
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which delay initial condition voltages, e.g., 
Da, D b . These signals are discrete in nature and 
the necessary delay may be discontinuous, i.e ., dis- 
crete sampling, storage, and discharge. Second are 
those which delay continuous voltages, e.g., D c , 
Dd . This second type is extremely difficult to re- 
alize economically, particularly when the time de- 
lays are long. For this reason all of the delays de- 
signed for this prototype computer are of the 
first type. The continuous signal delays are ap- 
proximated by smoothing operations performed 
on the discontinuous delays. 

2) The integrator operation control circuit changes 
the operational state of integrators (Reset, Hold, 
Operate) in response to command signals from a 
timing circuit. 

3) A heat input circuit provides the kth. section heat 
forcing function when the kth. section equations 
are being determined by the equation solving cir- 
cuit. 

4) Smoothing, gain, and phase inverting circuits per- 
form the functions their names imply. 

5) Gates Ai, A 2 , A s , A 4 , A 7 , As, and A 9 control the 
flow of signals. 

6 ) A timing circuit controls the sequence of opera- 
tions of the auxiliary devices and the main and 
time-sharing computer. Command signals from 
the timing circuit are shown as heavy lines in 
Fig. 4. 

Prior to the initiation of operation of the time-sharing 
computer, steady-state calculations are performed to 
obtain the initial conditions for all n sections. These 
initial conditions are denoted by T mk (o) and T Wk (o). The 
voltage analogs of these temperatures are stored in dis- 
continuous delay circuits Da and D B , respectively. 
These circuits consist of four tandem cells denoted by 
Ci through C 4 . The number of cells making up delays 
Da and Db does not necessarily equal the number of 
sections being simulated. The requirement is that the 
number of cells and the stepping rate yield a time delay 
of #Tij + (w— l)r s where n is the number of sections, r s 
the sampling time, and tr the reset time. For the subse- 
quent discussion delay Dc is assumed to consist of five 
cells permitting five samples in each sampling interval 
r s , i.e., p = 5. The time delay employed is t s for the first 
sample and t r -\-(P — 2 )t s /P for the four subsequent 
samples. If a continuous delay were to be used a delay 
of r s +Tij would be employed. 

The heat input forcing function circuit (shown in the 
upper left of Fig. 4) computes q±, &, q_z, and q 4 , and these 
variables appear at the inputs of gates Ai through A 4 , 
respectively. The outputs of these gates are multiplied 
to form the qu input line. 

Two additional engineering considerations remain. A 
sampling period is chosen to establish the rate of com- 


putation as controlled by the timing circuit. The choice 
of a sampling period is governed by the speed of the 
transient to be encountered, the degree of reactivity 
feedback via the temperature coefficient, the desired ac- 
curacy, the allowable machine running time, and other 
considerations. The sampling period is denoted by r s . 
The second consideration is the evaluation of the hot 
leg transport time r dh . This time fixes the number of 
tandem cells required in delay circuit Dd (output stor- 
age, Dd, could also be a continuous type delay, e.g., 
tape, if economic considerations permit). With T dfl 
established, the timing circuits are adjusted to provide 
such a delay. 

The operation of the time-sharing analog computer 
proceeds as follows. The computer integrators are set to 
Reset, installing initial conditions T mi (o) and T Wl (o ) at 
the outputs of integrators A and B. With gates A x and 
A 7 open and all others closed, the main and time-sharing 
computers are set to Operate condition. The circuit re- 
mains in Operate for r s seconds during which time the 
forcing function TiJJ) flows into the computer. Since 
gate A i is open, the heat flux presented to the circuit is 
q x . The output of summer D is, consequently, the analog 
behavior of T 0l (t ) for the period r s , i.e., the output water 
temperature transient of the first section of the four- 
section model during the sampling period r s . This output 
temperature transient is to become the input forcing 
function for section two of the model during the subse- 
quent operational period, and so provision is made to 
store discrete values of T 0l {t). Such storage is accom- 
plished by stepping the discontinuous delay circuit Dc 
at intervals during the initial r s seconds. Such a stepping 
action is caused to take place every r s /4 seconds by the 
timing circuit. The result of this action is the storage of 
five samples of T 0l (t) in delay circuit D c at the end of r s 
seconds. These five voltage analogs denoted by T 0l (o), 
r 0 l (l), T 0l ( 2), T 0l (3), and T 0 l (4) appear in cells C 5 , C 4 , 
Cs, C 9 , and C\ of Dc, respectively. At the end of r s seconds 
the main computer (external to the time-sharing com- 
puter) and the time-sharing computer are set to the 
Hold condition. Shortly thereafter, the initial condition 
delay circuits D A and D B are stepped placing conditions 
T m2 (o ) and T Wi (o) at the outputs of integrators A and B. 
Stepping delays Da and D B also causes the state of 
integrators A and B (at a time r s after the beginning of 
the transient) to be stored in C x of D A and D B . These 
analog voltages are the initial conditions required for 
the second complete cycle of computation. They dis- 
place, in the delay circuits, the “initial” initial condi- 
tions. The notation employed for these conditions is 
T mk ( 4) and T Wk { 4) where the parenthetical number de- 
notes the state of the variable after a time 4 r s /4 sec- 
onds. Then, with gates Ai and A 8 open and all others 
closed, the time-sharing computer is placed in Reset 
and shortly thereafter in the Operate condition. 

The input forcing function for the second section is 
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now the output of the first section as previously com- 
puted. This signal is introduced by stepping the Dc de- 
lay. Such stepping causes the discrete analog voltages to 
pass out of the delay, through the smoothing, gain, and 
phase inverting circuit, and into the computational 
circuit via gate A s . This same stepping of Dc causes the 
output transient of the second section to pass into the 
delay D c for storage and future use in the next cycle of 
computation. Again, the output transient is sampled at 
five points r s /4 seconds apart in time. The output tran- 
sient from the third section is obtained during the third 
r s interval of time by the same sequential process em- 
ployed in solving the section two response, and similarly 
for section four. 

At the end of the fourth r s second interval the con- 
tents of delay circuit Dc are five voltages representing 
samples of the outlet water temperature transient dur- 
ing the initial sampling period of the input water tem- 
perature forcing function. At this time, with the time- 
sharing and main computer in the Hold condition, the 
initial condition delays Da and Db are stepped placing 
r wi (4) and T Wl ( 4) upon integrators A and B. Further, 
gate As is closed and A 7 opened permitting the second 
r s interval of the inlet coolant temperature forcing func- 
tion to drive the first section computation when the 
computers are set to Operate. Gate A\ is opened and the 
time-sharing computer is set to Reset. Both computers 
are now placed in Operate and the second cycle of com- 
putation begins. The four section computations proceed 
as previously described. During the first r s seconds of the 
second complete cycle gate A 9 is open. This open gate 
permits the output transient from the first cycle to pass 
into the output storage delay circuit Do as the first sec- 
tion response to the second sampling interval displaces 
this information in storage device Dc. 

As the computation proceeds in a cyclic fashion the 
output storage Dd becomes filled with samples of the 
desired output coolant temperature transient. Each an- 
alog sample, spaced r s / 4 seconds apart in time, is stored 
in sequential order in the Dd device. The earliest (time- 
wise) voltage appears in the highest order cell and the 
latest voltage sample in the lowest order cell, i.e., the 
input cell number (Ci). As soon as sufficient r s second 
intervals have elapsed so that the sum of the intervals 
totals the hot leg transport delay r dh , the output stor- 
age delay begins to discharge the sampled output tran- 
sient. This output information is sent out in spurts, four 
section computation times apart. Each data spurt con- 
sists of five voltage samples spaced r s /4 apart in time. 
Such discrete data may be smoothed to convert to a 
continuous analog form. 

B. Feedback Considerations 

Time-sharing techniques must include provision for 
feedback signals such as the temperature feedback loop 
signal in the simulation of a reactor plant with a nonzero 
temperature coefficient of reactivity. The effect to be 
simulated can be described as 


= / j p Wdr„,(0] j . (5) 

Or if the temperature coefficient, K T c k , is assumed spa- 
tially constant 

d[8k TC (t)] = f[K T dT„*(t)] ( 6 ) 

where 

1 n 

Ta.ve{t) = ^ T Wk (t). (7) 

n *=1 

An exact summation process, as required by (5) or (7), is 
not possible with time-sharing techniques. This inherent 
limitation is so because the instantaneous behavior of 
the average water temperature in all n sections is known 
only during the computation of the final or nth. section, 
and then only if all previous T Wk (t) transients are 
stored. 

The circuit next described approximates T aYe {t) as 
given by (7). The method proposed is the repeated cor- 
rection of the average existing at the start of any one 
complete cycle by the use of the section data as it be- 
comes available. Listed below are equations which 
describe such a method. 

T ave at the start of a four-section cycle is 

T„.(o) = 4 £ T,„(o) 

4 k =i 

during the first section computation 
1 4 1 

Tave(0 ~ ~~ ^2 Twk(°) 4 ~~ \_T W] if) ^4n(4)]> 

4 1 4 

during the second section 

1 4 1 

T m (t) = — £ T Wk (o) + — [r.,(4) - r„»] 

4 1 4 

+ 4 [r„w - r„(#)] ; 

4 

and the third section 

t„. = 4 £ r„,(») + 4 [r*,(4) - z^M] 

4 1 4 

+ ~ [Tw 2 ( 4) — T W2 (o)] + — [T W JJ) — T W3 (o)\ 

and finally, during the fourth section computation the 
average becomes 

1 4 1 

r.ve(<) = -7 £ T Wk (0 ) + — [7^(4) - T Wl (o)] 

4 1 4 

+ 4 [r„(4) - r„»] + 4 [r„,(4) - r„»] 

+ — \T Wi (t) ~ T.njo)]. 

4 
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At the end of the first complete cycle of computation the 
r ave signal is 

r„«( 4) = -f £ r„,(4) 

4 *_ i 

and so, during the next cycle, the identical process can 
be repeated. 

A circuit to accomplish this task is shown in Fig. 5. 
The expressions, indicated in terms of temperature, are 
the analogs of the voltages which, of course, actually oc- 
cur. The state of the circuit is that which would exist at 
the start of the first section computation of the first 
cycle. 

The operation of the circuit proceeds as follows. Prior 
to time zero, 

1 4 

~ ^22 T Wk {°) 

4 k = i 

is stored on capacitor C\. Relay T ave is de-energized and 
T Wl {t) — T Wl (o ) so that the output of summer P is also 

4 £ T wi (o). 

4 i 

The analog voltage of this term is applied to capacitor 
C 2 through the NC X contact on relay T ave . The inputs of 
summer Q are -\-T Wl (t) and — T Wl (o) which are obtained 
from the output of integrator B and the gain and phase 
inverter following delay D B , respectively. Integrator B 
and delay D B are shown in Fig. 4. During the first sec- 
tion computation (computers set to Operate) T Wl (t ) 
begins to differ from T Wl (o). This difference is computed 
by summer Q and added to the original summation 
stored on C\ by summer P after being attenuated by \/n 
by the input potentiometer shown. This new voltage is 
applied to capacitor C 2 . 

At the end of the first section computation, relay T ave 
is energized and maintained up during the second sec- 
tion period. Now capacitor C 2 “remembers” the initial 
voltage and the new sum consisting of 

— £ T wt (o) 

n 1 

+ 4 [r„,(4) - r„,( 0 )] + 4 \t,M - r„»] 
4 4 

is applied to capacitor C 2 through the NO x contact. Dur- 
ing this period, the inputs to summer Q are T Wi (t) and 
T W2 (o). Relay F ave is thus alternately de-energized and 
energized until all four sections have been computed. At 
the end of four periods capacitor C 2 has the analog of 

4 £ T. t (4) 

4 fc= i 

stored upon it. Summer R corrects the stored signals for 
attenuation and dc shift suffered in passing through the 
cathode-follower read-out circuit. The succeeding cycles 
proceed as the first. 



CATHODE FOLLOWING OUTPUT GAIN AND D.C RESTORER 


Fig. 5 — -Summation circuit to approximate the average 
coolant temperature. 

C. Time and Space Dependent Forcing Functions 

Provisions for forcing functions which are both time 
and space dependent require another novel time-sharing 
circuit. The heat flux input to the reactor core is a typi- 
cal example. 

By finite differencing techniques the &th section heat 
flux input can be approximated by 

qk(t) = m k q T (t), 

where m k is constant. Several system variables cause 
time variations in the reactor core— heat flux, [gr(/)], 
e.g., changes in rod position, coolant temperature, and 
pressure. If the sampling period of the time-sharing com- 
puter is chosen so that the core heat flux, qr(t), changes 
appreciably during the period, provision must be made 
to include such variations in the computation. 

Fig. 6 is a circuit diagram of a heat flux circuit which 
provides both a space and time variant forcing function. 
During the first sampling period relay Q is inoperative 
permitting the qrit) signal to flow to the input bus of the 
m k scaling potentiometers and to the input of discon- 
tinuous delay Dq (this delay could also be of the con- 
tinuous type, e.g., magnetic tape). While the qr signal 
flows, the Dq delay is stepped causing five (arbitrary 
number) samples to be stored within the delay. Poten- 
tiometer m x scales q T (t ) yielding qi(t). During this time, 
rotary switch R Q is in position 1 . Thus, qi(t) appears at 
the output of the heat flux circuit. At the start of the 
second section computation relay Q is operated and 
switch Rq is stepped to position 2 by pulsing the step Rq 
lead. During the second interval delay Dq is stepped pe- 
riodically causing the initial <Ze(0 signal to reappear on 
the potentiometer input bus as well as at the input of 
the delay. Smoothing and gain are applied to the discon- 
tinuous signal as indicated. During this period g 2 (0 ap- 
pears as the output of the flux circuit. This sequence is 
continued until all n sections have been computed. At 
the completion of the computational cycle rotary switch 
Rq is set to position 1 by pulsing the home Rq lead which 
actuates the release magnet. Relay Q is released and the 
circuit is ready for the second sampling period. This cir- 
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cuit performs the functions of gates A\ through A 4 of 
Fig. 4. 

D. Time Dependent Forcing Functions 

Another class of forcing functions which must be 
handled by time-sharing are those which are time de- 
pendent only. An example of this class is the coolant 
flow rate through the reactor core. For nonconstant flow 
(1) and (2) are amended to read: 

JAA = cMt) - ( 8 ) 

dt 

dT Wk (t ) r , 

— W = aj /(<)»- 8 [r mi (<) - 

dt 

- /(<)[r„,(<) - r { ,(0]. (?) 

Clearly those terms in (8) and (9) with coefficients 
ce 2 , a s , and a± are dependent upon flow rate. Fig. 7 is a 
circuit designed to include the effects of variable flow 
upon the reactor core analog simulation. The circuit is 
shown as it would appear when augmenting the time- 
sharing simulator illustrated in Fig. 4. Only integrators 
A and B of Fig. 4 are indicated and all other components 
are omitted for simplicity. The variable flow portion is 
set off by the heavy broken line in Fig. 7. At the start 
of the transient run relay F is de-energized allowing the 
flow signal f(t) to pass into the computer as a continu- 
ous function. During the initial sampling period, delay 
D f is stepped causing discrete samples of f(t) to be 
stored in the delay (Df could be a continuous delay, 
e.g., tape). The sample f(t) also passes to multiplier M B , 
and the function generator FG1 and hence to multiplier 
M a . The outputs of these multipliers, Ma and Mb, are 
’"- desired functions 


Fig. 7 — Time variant flow rate forcing function circuit for a time- 
sharing computer solving the reactor core equations. 

and 

f(t)[T n (t) - T,M, 

respectively. At the end of the section 1 computational 
period, relay F is energized by applying voltage to the 
Operate F gate lead. During the section 2 and succeed- 
ing section computation periods the discontinuous delay 
Df is stepped periodically causing the initial f(t) sample 
to pass out of the delay, through the smoothing and gain 
circuit, and into the computing circuit. Thus, the sample 
is reused in each section period. At the conclusion of the 
complete w-section computing cycle, relay F is de- 
energized and the circuit is prepared to receive the 
second f(t ) sample from the main computer. 

III. Pilot Model 

In order to determine the workability and accuracy of 
the time-sharing computing method a pilot model was 
designed and constructed with sufficient capacity to 
solve a four-section reactor core heat transfer problem 
with constant coolant flow and uniform axial heat flux. 

The necessary delay circuits for the pilot model were 
designed by an extension of an invention attributed to 
Janssen 2 and later demonstrated by Philbrick. 3 A block 
diagram of a delay circuit is shown in Fig. 8. Buffer 
amplifiers B\, Bz, etc., have the following properties: 

1) very high input impedance, 

2) very low output impedance, 

3) amplification close to unity, and 

4) high available output power. 

2 J. M. L. Janssen, “Discontinuous low-frequency delay line with 
continuously variable delay,” Nature, vol. 169, p. 148; January, 1952. 

3 “A Palimpset on the Electronic Analog Art,” ed. by H. M. 
Paynter, G. A. Philbrick Researches, Inc., Boston, Mass., p. 163; 
1955. 
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Fig. 8 — Block diagram of a discontinuous delay line with continuous 
variable delay (after J. M. L. Janssen, Royal Dutch/Shell Labo- 
ratory, Delft, Netherlands, October 25, 1951). 

These devices were obtained by the design of an extra- 
linear cathode follower. Switches Si, Si, etc. have char- 
acteristics as follows: 

1) very low forward impedance, 

2) very high reverse impedance, and 

3) controllable by external command signals. 

The switches for the delay circuits of the pilot model 
were designed for two different applications of the de- 
lays: 

1) The short time delay circuits, e.g., the recycled 
forcing function delays (delay Dc of Fig. 4), re- 
quired bilateral electronic switches patterned 
after the work of Philbrick. 3 

2) The long time delay circuits, e.g., the initial condi- 
tion delays (Da and Db of Fig. 4), were designed 
with fast-acting relay contact switches. 

Capacitors Ci, Ci, etc., are extremely low-leakage com- 
ponents. Output amplifier, A ou t, has: 

1) adjustable gain, 

2) very high input impedance, and 

3) low output impedance. 

This component was obtained by cascading a buffer 
amplifier and a conventional analog computer dc ampli- 
fier. This arrangement permitted the required smooth- 
ing operation and the gain adjustment to be performed 
within the output device. 

To begin the explanation of the delay circuit it is as- 
sumed that all switches are open and all capacitors ini- 
tially uncharged. At time zero all odd number switches 
are closed for a sufficient time to cause capacitor Ci to 
charge to E(o). At time T all even number switches op- 
erate causing Ci to assume voltage E(o ). The odd 
switches are then closed at time 2 T allowing E(2T) to 


charge C\ and E(o) to pass to C3. At time 3T even 
switches are closed moving E(2T) to C2 and E(o) to C 4 . 
The process of alternately closing the odd and even 
number switches is continued with closures every T 
seconds. Eventually, after (n—\)T seconds, voltage 
E(o) appears on C n and hence becomes the first output 
sample. Following this voltage, every 2 T seconds, are 
E(2T), E(4T), E(6T), etc. Thus a delay of (n— 1) T is 
achieved. Since T, the switching period, can be con- 
trolled, the objective is achieved. 

The timing circuit of the pilot model was synchro- 
nized with a master clock. Clock pulses were used to 
drive bistable multivibrators which performed desired 
frequency divisions. The resulting subharmonics of the 
clock pulse train were directed to a logic circuit which 
generated the necessary control pulses to execute the 
desired sequential switching plan. The control pulses, 
after receiving power amplification, actuated relays 
whose contacts formed a switching network. The signals 
from the network controlled the operation of the com- 
ponent devices which made up the time-sharing com- 
puter. 

IV. Test Results and Conclusions 

In order to evaluate the performance and accuracy of 
the pilot model a series of tests were run in which the 
reactor core heat transfer equations were simulated 
with zero heat input and constant flow, i.e., a simple 
transport delay problem described by: 


dT Wk 

2 n 

2 n 
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' " d' v>k , 
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dt 

r do 
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The forcing function was a cosine shaped increase in the 
inlet coolant temperature. The results of these experi- 
ments indicated: 

1) Conventional and time-sharing circuits are com- 
patible and reproducible results are obtainable. 
Switching transients, relay contact “races,” and 
switching synchronism problems are evident but 
they can be overcome by proper circuit engineer- 
ing. 

2) Simulation accuracy is a function of the sampling 
interval employed in the delay circuits and the 
method of signal smoothing employed. Fig. 9 
shows a typical input-output trace. The circuit was 
forced by a 0.785 rad/second cosine rise in inlet 
coolant temperature. Delay D c of Fig. 4 was 
smoothed by a l/(r c 5 + l) filter in which the opti- 
mum t c was found to be 0.03 second. The maxi- 
mum per cent departure from the ideal delayed 
transient (also shown in Fig. 9) is 3.1 per cent oc- 
curring 3.6 seconds from the start of the transient. 

3) The accuracy of the simulation of systems in 
which feedback signals dependent upon instan- 
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Fig. 9 — Results of a time-sharing simulation of transport delay. 


Fig. 10 — Operation of the summation circuit. 


taneous spatial averages, is limited by the inher- 
ent inability of the time-sharing computer to cor- 
rectly obtain such averages. The operation of the 
circuits designed to obtain the approximate aver- 
age coolant temperature was, however, successful. 
Fig. 10 illustrates the results of approximating the 
spatial average by employing the circuitry previ- 
ously described. In this figure each r s segment of 
the input forcing function produces four output 
traces since the average water temperature is con- 
tinuously corrected as new data become available 
from the four sections in turn. The nature of the 
approximation is seen by comparing the summa- 
tion circuit output traces, shown as solid lines in 
Fig. 10, with the ideal average temperature shown 
as a dashed line. 

V. Cost and Equipment Requirements 

In order to compare the approximate cost and equip- 
ment requirements of time-sharing computation with 
conventional analog computation a calculation of these 
requirements for a large but typical problem was per- 
formed. The problem considered was the simulation of 
the heat transfer phenomena of a reactor core coupled 
to a heat exchanger within a pressurized, forced convec- 
tion system. The simulator was designed with sufficient 
capacity to provide the following: 

1) variable coolant flow, 

2) nonuniform axial heat flux, 

3) temperature coefficient of reactivity feedback, 

4) variable steam throttle opening, and 

5) steam temperature feedback in the heat ex- 
changer. 

The most important results of the calculation are 
shown in the three graphs of Fig. 11. Fig. 11 (a) compares 
the cost of time-sharing solutions to conventional cas- 
cade solutions as a function of the number of sections. 


(a) 



(b) 



(c) 



Fig. 11 — Cost and equipment comparisons. 

Time-sharing costs are divided into two parts. The 
first, shown cross-hatched in Fig. 11(a), represents the 
cost of that equipment which is peculiar to time-sharing 
and which could not readily be used in non-time-shared 
applications. The second component cost is that of 
multipurpose equipment, e.g., amplifiers, potenti- 
ometers, multipliers, etc., which, of course, could be 
used for non-time-shared problems. Fig. 11(b) and 
11(c) shows the amplifier and potentiometer require- 
ments as a function of the number of sections simulated. 

The ratio of the machine running time required to 
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solve a sectionalized problem by time-sharing to the 
running time by the conventional cascade method is 
given by: 

time-sharing running time (r s + r r) 

; ; = n • 

conventional running time r s 

VI. Conclusions 

The results of this study indicate that a time-sharing 
analog computer for the solution of differential-differ- 
ence equations is realizable and is economically attrac- 
tive in certain circumstances. If the number and size of 
multisection problems to be computed are large, the 
equipment conservation possible with time-sharing can 
conceivably outweigh the accuracy and running time 
penalties. 

The three phases of this study, viz., design, develop- 
ment, and optimization, yielded more specific conclu- 
sions. The design of a system to solve the reactor core 
and boiler heat transfer multisection equations re- 
vealed that considerable savings in operational ampli- 
fiers and potentiometers would be enjoyed when the 
number of sections simulated exceeded three. First cost 
of the auxiliary equipment required for the time-shar- 
ing solution of these equations is estimated to be equiva- 
lent to the first cost of a sufficient amount of conven- 
tional analog equipment to solve a four-section prob- 
lem. Programming and set-up time would be markedly 
reduced by time-sharing particularly if a large number 
of physical parameters and analog scaling factors were 
common to many sections. Such a savings would tend 
to offset the increase in running time required by time- 
sharing. Design considerations further revealed the 
upper bound on the accuracy obtainable under time- 
sharing to be the accuracy achieved by the conventional 
cascade analog solution. This accuracy can be ap- 
proached as 

1) the sampling interval, r s , approaches zero, 

2) the number of samples, p, obtained for the forcing 
function delays approaches infinity, and 

3) the highest frequency components of the input 
forcing functions approach zero. 

The development phase of this study produced work- 
ing models of all of the circuits essential to the time- 


sharing system. These units were successfully integrated 
with conventional analog computing equipment and 
test problems were run. The electronic and relay circuits 
proved reliable and the total system yielded reproduci- 
ble results. The delay and timing circuit designs, as con- 
ceived for the four-section pilot model, can be readily ex- 
tended to handle additional sections. 

Much additional attention can be given to the op- 
timization of the time-sharing system. The effect of the 
variation in computing parameters, e.g., r s , upon accu- 
racy might profitably be studied. Newly available analog 
devices, such as magnetic tape loop transport delays, 
could provide improvements in computational accuracy 
and thus merit investigation. 



Appendix 

List of Symbols 


Symbol 

Description 

Units 

a i 

Constant in heat transfer equations 

ft 2 °F/Btu 

a 2 ', az, an' 

<rkrc 

Constant in heat transfer equations 

1/sec 

Reactivity due to temperature effects 

reactivity 

Kt 

Over-all temperature coefficient of reac- 
tivity 

reactivity /°F 

Krc k 

Temperature coefficient of reactivity for 
the &th section 

reactivity/°F 

m 

Ratio of instantaneous coolant flow rate 
to the time-zero flow rate 

dimensionless 

m k 

Ratio of &th section average heat flux to 
the total average heat flux 

dimensionless 

n 

Number of axial sections 

dimensionless 

P 

Number of samples obtained for delays 
during r K interval 

dimensionless 

5* 

Spatially averaged heat flux in the &th 
section 

Btu /second 
ft 2 

q T 

Spatially averaged total heat flux 

Btu/second 

ft 2 

t 

Time 

second 

T 

Switching period of discontinuous delay 

second 

T C 

Smoothing circuit time constant 

second 

T do 

Coolant transport time through reactor 
core coolant channel 

second 

Tdh 

Hot leg transport time, core to heat ex- 
changer 

second 

tr 

Time-sharing reset time 

second 

T» 

Time-sharing sampling time 

second 

-lave 

Spatially averaged mean coolant temper- 
ature 

°F 

Ti k 

Mean coolant temperature at entrance to 
the kth section 

°F 

Tm k 

Spatially averaged metal temperature in 
&th section 

°F 

To k 

Mean coolant temperature at exit of the 
&th section 

°F 


Spatially averaged mean coolant temper- 
ature in kth. section 

°F 
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Computers — The Answer to Real-Time Flight Analysis 

GUENTHER HINTZEf 


Introduction 

T HE development of real-time data processing fa- 
cilities has become an important task for the mis- 
sile test ranges. The requirement for efficient test 
programs and the desire of the missile developers to have 
the results of missile firings “as soon as possible” after 
the flight, are pressing and many studies and develop- 
ments for improved test facilities with faster data proc- 
essing are being undertaken. These developments can 
be broken down into two major categories: instrumenta- 
tion and computers for the reduction and analysis of 
the measured data. As long as the data are only re- 
quired “very fast,” say one or a few hours after the 
flight, the main problem is the data collection, trans- 
mission, and conversion to make them in the proper 
forms available to high-speed computers for which the 
further reduction and print-out presents no particular 
difficulty if enough computer capacity and peripheral 
equipment is available. 

However, the computational problem is of a different 
order for real-time data analysis whereby, based on this 
analysis, actions might be taken while the flight is still 
in progress in order to obtain conclusive information or 
to influence the flight experiment in such a way that the 
accomplishment of desired objectives might be secured. 
Here is a new, challenging task for electronic computers 
leading to the development of new concepts and the im- 
provement of existing computers. Ground guidance 
computers which have been developed for several guided 
missile systems perform a similar job, directing a mis- 
sile according to real-time data which are being con- 
tinuously observed during the flight. There is, however, 
a main difference : while these computers have to direct 
a well or nearly well behaving missile, a fully developed 
real-time flight analysis and control facility is mainly 
called into action when things are not going well and 
something has to be done to save a costly flight test. 
It is evident that this is a formidable task and in this 
paper some ideas will be presented concerning the order 
of magnitude of the required computing equipment. 

The Basic Concept of Real-Time 
Flight Analysis 

Fig. 1 shows a simplified diagram of the real-time 
flight analysis and control concept. Simultaneously 
with the actual flight of the missile, the flight is also 
simulated on a missile simulator where initial conditions 
and parameters are set exactly in accordance with the 
real missile. Through real-time range instrumentation, 

f Flight Simulation Lab., White Sands Missile Range, Las Cruces, 
N. Mex. 



parameter presetiinys 

Fig. 1 — Simplified diagram for real-time flight analysis. 


including real-time data collection, transmission, and 
reduction, a ground computer receives continuously or 
in sampled form inputs, concerning the motion of and 
the events within the missile. In the diagram this- 
computer is called flight analysis and control device. 
In order to make the simulated-flight compatible, the 
actual real-time driving and control functions are sent 
to the missile simulator according to the measurements 
which are obtained from the flight. 

The data describing the response and performance of 
the simulated missile and its components are compared 
in the flight analysis computer with the actual flight 
data. As long as these values compare within certain 
predetermined and specified margins, no actions may be 
taken. If deviations develop which are still not critical 
but which warrant some investigation about the reasons 
for the deviation, diagnostic commands may be relayed 
to the missile switching over from some standard meas- 
urements in noncritical areas to additional measuring: 
pickups which had been previously installed in sus- 
pected trouble areas. Sound engineering judgment is 
required to place such additional information sources. 
Case histories of previous flights and test experience 
will help in this regard. For example, a common 
trouble source are accelerometers which are used ac- 





Hintze: Computers — The Answer to Real-Time Flight Analysis 


351 


tively in guidance and control loops, and it is indicated 
to mount close to these accelerometers vibration pick- 
ups which can be switched in to determine quantita- 
tively and timely the rise and propagation of vibrations 
which might cause erratic accelerometer outputs. 

When failures occur which disable the missile to con- 
tinue its flight properly and which will lead to aborted 
rounds, remedial commands to the missile shall be insti- 
tuted. The analysis scheme must be detailed enough to 
decide if these commands should merely isolate a failing 
component, or send substitute command signals to the 
missile. For the future, it is even visualized to have some 
reserve jatos built in test missiles which could be used 
to cause a change in velocity from the ground. 

It will require many diversified developments to ad- 
vance real-time flight analysis and control to the point 
where it will be a workable and reliable facility which 
does not add to the difficulties of a timely execution of 
flight test programs, but which will be a valuable help 
for better understanding and utilization of missile 
flight experiments. It is felt that the part which com- 
puters will play in the accomplishment of such facilities 
is decisive, particularly if the test engineers and systems 
analysts succeed in formulating precisely the mathe- 
matical relations on which the analysis and the making 
of decisions is based. 

Mathematical Formulation of the Problem 

The description of the tested guided missile system in 
form of a mathematical model is the necessary prerequi- 
site for a further analysis of this system and the meas- 
ured flight data on computers. The mathematical 
models which are presently used for flight simulation 
studies are deterministic or rigid analytical models 
where the input-output relations for missile motion and 
individual components are expressed in mechanistic 
equations. Fig. 2 showstherigid enalytical representation 
of the following portions of a missile system: the missile 
dynamics which determine the three vectors of lateral 
and angular accelerations as functions of force and 
moment inputs; the direction cosine matrix for trans- 
formation from missile body to radar axes using the 
angular rates of the missile body axes as input; the 
ground guidance computer which receives the rectangu- 
lar missile coordinates in the radar frame; and finally, 
the control subsystem of the missile causing fin-deflec- 
tions according to the command input from the ground 
and the missile motion. 

This is an extremely simplified representation of a 
missile system which shall serve only the purpose to ex- 
plain that most of the inputs are not fixed values, but 
they are characterized by some type of distribution. 
Even if the simulation state of the art is such that pres- 
ently the number of stochastic inputs in simulation 
studies are quite limited, it must be borne in mind that 
a realistic simulation of a missile system which can be 
used as a reference in evaluating flight data must take 
into account these random variations of inputs. 


Some of the areas where consideration should be given 
to stochastic inputs are the airframe characteristic 
anomalies such as airfoil malalignment and wind tunnel 
variation. In case of a spinning missile, fin malalignment 
may be a contributor to miss distance as much as some 
electronic failures. Burning anomalies affect the missile 
mass and inertia. Airframe flexure failure modes limit 
the required missile dynamics. Radar noise measured 
under one set of atmospheric conditions may be quite 
different from that measured under a changed condi- 
tion. Component production variation effects, the mal- 
alignment of sensing elements, and the equipment re- 
sponse variations due to temperature or vibration are 
also stochastic variables. 

It would be unjustified to draw conclusions from the 
comparison of flight data with some fixed value where in 
reality there is not such a thing as fixed values. The 
analysis must consider the distribution of these values 
and ultimately a probabilistic simulation model should 
be used as reference for real-time analysis. However, the 
development of such missile models is not yet com- 
pleted. Great progress has been made in the realistic 
simulation of the random noise of track radars, but for 
many other stochastic missile variables, the amplitude 
distribution and frequency characteristics have not 
been determined. Therefore, in the following discussion, 
reference is made to existing, rigid analytical models, 
though the ultimate requirement for probabilistic an- 
alytical models is being recognized. 

Computer Requirements 

The computer requirements for conducting the real- 
time flight analysis are dictated principally by the de- 
gree of realism in the guided missile system simulation, 
and the complexity of the analysis schemes to be used. 

Computer Requirements as a Function of Realistic Simu- 
lation 

The simulation according to Fig. 3 when carried out 
in full without simplification will represent a guided 
missile system in 3 dimensions and 6° of freedom as it 
operates in rotating, oblate spheroidal earth, with ex- 
ternal meteorological wind, gravitational attraction 
which varies according to latitude and missile altitude, 
and electromagnetic noise caused by track radars. 

A great amount of computing equipment is required 
for the various coordinate transformations. If the aero- 
dynamic forces and moments are not given in missile 
body axes, they have to be converted into these direc- 
tions by use of a direction cosine matrix which is a 
function of the components of the missile velocity with 
respect to the air. 

Since missile velocity is with reference to the body 
frame, the external wind velocity, expressed in the local 
ground frame, needs to be first transferred to a form re- 
ferred to the body frame. Then this wind velocity is ex- 
pressed in body frame components by use of a direction 
cosines matrix. From the missile rotational velocities, 




Fig. 2. 
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Fig. 3 — Verbal diagram of a simulation of the guided missile system. 
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the earth rotational velocity, and the change of missile 
latitude and longitude with respect to time, this set of 
direction cosines is developed. 

Another matrix of direction cosines is developed 
which relates the missile velocity components in the 
local ground frame to velocity components in the mis- 
sile tracking radar frame. After the missile position is 
determined in this rectangular frame, the position is 
converted into components in the spherical frame. 
These components are inputs to the missile tracking 
radar. 

Realistic simulation of thrust forces involves repre- 
sentation of starting and tail-off transients of the motor 
as well as variations in the thrust due to change in ambi- 
ent air pressure. The force due to gravity, when realisti- 
cally simulated, requires that the mass attraction vec- 
tor be computed where missile latitude and altitude 
have been considered. In computation of the transla- 
tional and rotational velocities and accelerations of the 
missile, the simulation needs to account for the change 
of mass and of inertia of the missile during its flight. 

Fig. 3, from which a few details were discussed, repre- 
sents somewhat realistically the guided missile system 
operation with the chief influencing factors. However, 
for such a simulation, about 600 amplifiers with asso- 
ciated nonlinear equipment would be required. 

A more detailed description of the missile control 
subsystem, indicated by one box in the general simula- 
tion plan, is presented in Fig. 4. Control of the missile 
is achieved in each plane by comparing the command 
voltage to the sum of three feedback voltages. 

The rate gyros and accelerometers are being simu- 
lated mathematically by transfer functions having 
second order denominators. The fin position voltages are 
seen to be functions of stagnation pressure as well as 
corresponding shaft position. The networks for each 
control channel are represented by a set of equations 
which together describe the behavior of the network. 
Then each steering amplifier and electrohydraulic as- 
sembly is expressed by a transfer function representing 
the dynamic characteristics of this link. 

Computer Requirements as a Function of Complexity of 
the Analysis Scheme 

The extent to which the complexity of the analysis 
scheme influences computer requirements may be ob- 
served from discussion of Fig. 5. This figure outlines 
briefly in diagram form the general form of a real-time 
flight analysis setup. 

In this scheme, it is seen that the actual guided mis- 
sile system acts as the input source. Quantities repre- 
sentative of accelerometer outputs, rate gyro outputs, 
missile altitude, and missile position are transmitted by 
telemetry or other means to a computer which uses these 
data to determine missile angular velocities, angular ac- 
celerations, aerodynamic moments, translational ac- 
celerations and velocities, angles of attack, and aero- 
dynamic forces. If angles of attack are not measured 


directly, external wind velocity as a function of loca- 
tion and altitude is needed in this computation. By use 
of this information, the position of the missile may be 
calculated, then compared to missile position as given 
by the missile tracking radar. If the results reasonably 
agree, according to errors allowable to data processing 
and telemetry, then the information, derived by process- 
ing the telemetered accelerometer, rate gyro, and alti- 
tude information, may be considered as useful in con- 
ducting an analysis of the actual guided missile system. 

Telemetered information concerning voltages at vari- 
ous points in the missile control subsystems is conveyed 
along with the accelerometer and rate gyro information 
to a simulation of the missile control subsystems. Mal- 
functioning within a certain block of real missile equip- 
ment may then be determined by making the input to 
corresponding simulated block exactly equal to the 
block input in the actual missile. If the output of the 
simulated block is different from the real system block, 
then the real system block has malfunctioned. (Of 
course the assumption is made that the simulated sys- 
tem is quite validly represented.) At the same time, the 
difference between the unequal outputs may be deter- 
mined in order to calculate an additional input to the 
real system causing it to perform as desired. For ex- 
ample, if the corresponding commands to the simulated 
missile and to the real missile are identical, but the cor- 
responding voltages at the outputs of the receivers are 
greatly different, then it can be suspected that the re- 
ceiving and translating circuits have malfunctioned. 

Another section of the computer facility in the real- 
time analysis setup would calculate the aerodynamic 
force and moment coefficients starting with the use of 
theoretical expressions for the coefficients and the actual 
values for fin positions, angles of attack, dynamic pres- 
sure, and missile velocity. If these calculated values of 
the forces and moments are much different from the cor- 
responding values as determined from rate gyro and 
accelerometer information, then commands would be 
generated which would force the real missile to respond 
only in the pitch or yaw plane or in some restricted 
fashion. Thus the complexity of the aerodynamic co- 
efficient expressions would be greatly reduced and calcu- 
lation of corrected partial derivatives would be simpli- 
fied. 

The mathematical relations as well as the required in- 
puts and the produced outputs of the optimization 
scheme for the aerodynamic coefficients are shown in 
more detail in Fig. 6. For the computations which have 
to be performed for the coefficient optimization, it is 
assumed that sufficient different numerical values of the 
measured quantities are supplied for the determination 
of the wanted coefficients. These values have to be meas- 
ured and processed in a time short enough that during 
this interval other variables, such as Mach number, are 
practically constant. 

The technique of making the missile respond to pre- 
programmed commands is already being applied in the 
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Fig. 4 — Missile control system block diagram. 
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Fig. 5 — Simplified presentation of analysis scheme. 
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Fig. 6 — Optimization of aerodynamic force and moment coefficients. 


so-called aerodynamic runs in which guided missiles are 
flown with inactive guidance systems. However, this 
method does not exercise a closer control of the missile, 
whereby certain variables such as bank angle and unde- 
sired motion in several planes are kept to a minimum. 
Provisions are made that according to the motion of the 
missile, commands can be relayed to the missile keeping 
it in the motion which is desired for the check on its 
aerodynamic characteristics. If in one particular phase 
of the flight the lift coefficient shall be investigated, the 
missile is kept from lateral motion and rolling. For a 
small range of Mach number, the lift coefficient then 
becomes only a function of fin deflection and angle of 
attack. With accurate measurements of these two pa- 
rameters, and the aerodynamic force by use of acceler- 
ometers, it is possible to obtain for this particular phase 
of the test good information on the lift coefficient. Other 
phases in the test can be used in a similar way for the 
determination of other force and moment coefficients. 

The foregoing discussion may convey some idea of 
the complexity of a real-time flight analysis computer 
setup. Considering the processes of comparison, and 
generation of forcing command, as well as circuits for 
automatic optimization of the simulated coefficients, it 
is reasonable to estimate the analog computer require- 
ments for the scheme discussed at about 700 amplifiers 


with the usual proportion of associated nonlinear equip- 
ment. 


Conclusion 

It is felt that present-day computers could handle 
real-time flight analysis on a somewhat limited basis, 
either switching in automatically additional informa- 
tion channels as required, or performing a comparing 
analysis on a few variables at a time. However, a full- 
fledged real-time analysis giving a comprehensive, quan- 
titative evaluation, adequate to understand the “how” 
and “why” of all flight events, may exceed present com- 
puter capability. 

The described analysis scheme, which is one of vari- 
ous possibilities, requires approximately 1300 amplifiers 
and proportional nonlinear equipment. The mechani- 
zation and solution of a problem of this size is feasible 
with existing analog computers, if enough time is avail- 
able to make the proper checks and adjustments before 
a run is made. 

However, when tied up with actual missile firings, no 
additional time can be permitted for the readying of the 
computers, because there are already too many other 
delays caused by instrumentation which aggravate the 
flight experiment of missiles. 

There seem to be two possible answers to this prob- 
lem. One is that digital computers be large enough to 
solve in real time a set of equations of the indicated 
order of magnitude. The problem of having digital com- 
puters always ready on a standby basis could be more 
easily solved. The second possibility would be the 
addition of automatic checking and adjustment fea- 
tures to the analogs working reliably enough to have an 
accurate performance of the analog computer assured 
every time a missile is ready to go. The combination of 
analog and digital computers may offer some promise. 

Although the outlined computer requirements may 
appear formidable, the benefits to be derived from these 
computer capabilities make the effort to be expended 
worthwhile. The achievement of real-time flight analy- 
sis for conclusive evaluation of missile flights and for the 
control of these expensive and complex experiments will 
bring an advance to all guided missile programs which 
is so significant that it should be given fullest attention. 
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Industry’s Role in Supporting High-School 

Science Programs 

J. 0. PAIVINENf, Chairman 


A N introductory statement was presented by the 
chairman, a member of the WJCC Technical 
Program Committee. He pointed out that the ob- 
jective of the meeting was to bring information on co- 
operative industry-school programs to the attention of 
interested computer people as well as invited high-school 
principals from the Bay area (20 out of 100 invitees at- 
tended). Through this meeting, the computer industry 
expressed interest and willingness to cooperate in addi- 
tional similar programs wherever the need may be evi- 
denced by schools or teachers. It is not the intent to re- 
strict such programs to involve computer technology 
only, but to participate in broad science programs in- 
cluding a computer portion. In this way, students aim- 
ing at careers in science will be exposed to computer 
techniques as one of their available tools, while poten- 
tial computer people will have attained a broader grasp 
of basic technology. 

Dr. Paul Hurd, Chairman ( Professor of Science Educa- 
tion at Stanford University , Calif.) 

Dr. Hurd portrayed the program as illustrating ex- 
amples of cooperation between industry, professional 
people, and civic groups, working with high schools to 
enrich the technical exposure of high-school students. 
The needs of our society have expanded from one in 300 
employees in 1900 needing a scientific, engineering, or 
mathematical training for their positions to one in 37 
needing such training in 1958. The trend is still in- 
creasing for such training. Conversely, schools need the 
help of industry and civic groups to establish such pro- 
grams not only to avoid increasing the teacher’s burdens 
but to bring to the teachers a knowledge and back- 
ground in newer areas of technology and science. Ex- 
amples in the program will point out that the success of 
cooperative programs depends always on both an in- 
terested teacher and interested community members to 
participate in the extra work. In summary, the examples 
in the program show how a new dimension can be 
brought to education in which the community itself 
takes an active part by contributing time, technical 
skills, and support to enrich high-school and junior 
high-school programs. 


t General Elec. Co., Palo Alto, Calif. 


Darryl Littlefield ( Physics and Science Teacher, Liver- 
more Union High School, Livermore, Calif.) 

An elective course is offered as part of the curriculum 
at the high school to cover applications and program- 
ming of the IBM 650. The objective of the course is to 
convey a real appreciation of the power and applica- 
tion of computers in modern research, engineering, and 
business. Problems are generally of a nature that would 
not normally be encountered by high-school students. 
Littlefield and Herman Thomas of the Math Depart- 
ment cooperate in the project. 

The course stemmed from Littlefield’s contact with 
the 650 at the Lawrence Radiation Laboratory (Uni- 
versity of California) at Livermore and his consequent 
interest toward using the community resources to es- 
tablish programs of real educational growth for his 
students. Dr. Sidney Fernbach of the Radiation Lab- 
oratory acts as consultant for the course while the Lab- 
oratory makes available machine time for use by the 
students. The Radiation Laboratory has also made 
available charts, models, and speakers for the class- 
room. 

The benefit of a week-long seminar on computer in- 
struction for teachers and engineers, arranged by Dr. 
Van Etta of Hughes Aircraft Co. of Los Angeles, was 
stressed. As another part of the school program, the 
construction of a relay binary adder-subtractor by two 
students using parts donated by Pacific Telephone and 
and Telegraph and IBM was mentioned. Dr. LaFrangi 
of the Radiation Laboratory acted as advisor. 

Joanne Watkins ( Senior Student, Livermore High 
School, Livermore, Calif.) 

Programming to solve the motion of a projectile in 
vacuum was described and flow charts, coding sheets, 
and plots of trajectories for varying initial elevations 
were shown. The course was enjoyable because it was 
new and interesting and has given a background ap- 
plicable even to other computers, together with an ap- 
preciation for what jobs can best be done on computers. 

Doug McMilin ( Junior Student, Livermore High School, 
Livermore, Calif.) 

A payroll calculation on the 650 was described and 
the input and output card layouts, flow charts, and 
programming sheets shown. The main points of the 
calculations were described. Personal access to a ma- 
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chine was found fascinating and mathematical principles 
in practical use enjoyed. The training in logical thinking 
was valuable and resulted from working a problem as a 
whole, with simultaneous attention to detail, while ob- 
serving the strict rules of computer programming. 

Tom Doyen and Ross Harrower ( Students , Livermore 
High School, Livermore, Calif.) 

The operation of the binary adder they had con- 
structed was explained, including mention of subtrac- 
tion by complements. A demonstration followed the 
description. 

Dr. Sidney Fernbach ( Staff Member, Lawrence Radia- 
tion Laboratory, Livermore, Calif.) 

Other cooperative programs supported by the Radia- 
tion Laboratory were described. A high-school commit- 
tee attempts to get students interested in individual 
projects, providing advisers on such topics as rocketry 
and nuclear energy. Speakers are also provided for sci- 
ence clubs and classrooms, and student tours are ar- 
ranged at both Berkeley and Livermore Laboratories. 
Training of high-school teachers in modern physics is ar- 
ranged through summertime and parttime employment 
where three lectures per week are provided on com- 
puters, modern physics, and chemistry. 

High-school students have been employed during the 
summer to convey a working knowledge of laboratory 
procedures; this summer, participation requiring even 
Q clearance will be arranged. 

Finally, general scholarship is encouraged by trying 
to increase student enrollment in college preparatory 
courses and by arranging for awards from the commu- 
nity for outstanding students. 

Henry Martin ( Physics Teacher, Palo Alto High School , 
Palo Alto, Calif.) 

Five to six years ago, industry help was sought in 
guiding the efforts of students who evidenced their 
eagerness to learn by after-hours and Saturday use of 
school facilities and the laboratory. However, little 
success was encountered due to the lack of any concrete 
program that Martin and the short-handed science staff 
had time to generate. This stalemate was broken in 1957 
by a consultant from the Joe Berg Foundation, 1712 
South Michigan Avenue, Chicago 16, 111., an organiza- 
tion that volunteers help in establishing an initial rela- 
tion between industry and schools. This resulted in the 
Palo Alto Science Seminar, which is an entirely locally 
conducted program. Fifty- two weekly sessions are con- 
ducted throughout the year, with 1 to 1| hours of each 
session devoted to a general program given by an in- 
dustry expert or a panel discussion with subsequent 
group discussions specializing in chemistry, geology, 
biology, physics, engineering, and mathematics. A stu- 
dent joins at about the tenth grade level and will ulti- 


mately undertake a project of interest to him with 
the counselling available of a volunteer industry pro- 
fessional. Industry also provides equipment, speakers, 
demonstrations, and sometimes facilities where the 
student often works in proximity to an engineer or a 
scientist. Advantages to the students include familiar- 
ization with work conditions and opportunities, oppor- 
tunity for individual creative study (since projects are 
not group efforts) , personal satisfaction of a hobby with 
prestige value, and the opportunity to seek and earn 
scholarships. 

Advantages to industry include an early encouraging 
hand to potential Ph.D. scientists, an opportunity to 
demonstrate to a broad slice of the community that 
scientists are normal human beings with families and a 
sense of humor, and an opportunity to contribute to the 
community and the future of our country. 

Other programs recommended in closing for indus- 
try’s consideration are: 

1) Participation of students as well as teachers at 
professional dinners and educational programs ar- 
ranged by industry. 

2) The assignment to some member of a company 
management team the specific responsibility of 
support and cooperation in educational affairs. 

3) The opportunity for students on field trips to 
spend sufficient time with the engineers and sci- 
entists to gain some insight into the significance of 
the projects or laboratories visited. 

4) Summer jobs for students which will increase their 
knowledge of industry practices and expectations. 

5) Screening and testing programs to select out- 
standing students to receive substantial scholar- 
ships. 

6) Summer schools at local universities so that 
students can see what they will be up against in 
the future. 

7) Public competitions for scholarship so that recog- 
nition could be given in a manner comparable to 
school athletic letters and Father’s Club dinners 
for athletes. 

Larry Hubbart ( Student , Palo Alto High School, Palo 
Alto, Calif.) 

Hubbart’s construction of a test stand as well as his 
subsequent experiments in measuring the lift of rotating 
airfoils (similar to an inverted pie plate) were described. 
An engineer from Hiller Helicopter Co. acts as adviser: 
his contribution was described as helping to suggest 
directions of investigation to pursue as well as to help 
maintain morale when the project appears to bog down. 
The test stand consisted of a counter-balanced scale 
with a drive motor and a photocell rpm counter. Air- 
foils of varying shapes and textures (some with added 
ducts) have been measured. The project was described 
as representing a success “even if it never leaves the 
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ground” due to the experience it provided in how to con- 
duct an experiment and how to present the results. 

Mike Macauley ( Student , Palo Alto High School , Palo 
Alto , Calif.) 

An experiment in combatting muscle fatigue by in- 
jected chemical solutions is being conducted. A rabbit 
is anesthetized and muscle contractions are caused by 
mild electric shock until exhaustion occurs. Injection of 
a mild hydrogen peroxide solution results in acceleration 
of the recovery: recovery periods range from 3 minutes 
for a 3 per cent solution to 0.3 second for 15 per cent. 
Hydrogen peroxide was chosen due to the safe decom- 
position products; procedures are followed to safeguard 
against bubble formation in the blood. 

The insights gained into medicine and medical re- 
search have reinforced interest in entering the field of 
medicine. 

Wallace Burton ( Engineer , R-S Electronics , Palo Alto , 
Calif.) 

The YMCA Men’s Club, in seeking to further en- 
courage science students and to provide recognition for 
work done in the Palo Alto Science Seminar, chose to 
present an annual Palo Alto Industry-Youth Science 
Show. The show presents awards to outstanding student 
science projects and also presents an opportunity for ex- 
hibits by local industrial firms. 

Announcements of the show with award categories 
are sent to all the high schools in the Palo Alto district 
and a participation of about 10 per cent of the students 
is experienced (the only other such event, the Bay Area 
Science Fair, accommodates only 1 per cent of the Palo 
Alto students). Three categories of participation are 
provided: 1) physical sciences, 2) biological sciences, and 
3) technical reports (given orally in competition). About 


one-half of the industrial firms contacted responded 
favorably to requests for financial support and indus- 
trial exhibits. In the first year, industry met $650 of the 
$850 expenditures, and this year the show will be self- 
supporting together with an expected elimination of the 
club deficit. 

The Science Show provides wide recognition to 
students through substantial adult attendance as well 
as the interest and awareness of the industry people who 
work as judges and exhibitors for the show. The Men’s 
Club provides the planning and organizing for the show 
so that the participation of the various schools can be 
drawn together into an integrated plan, this having been 
found necessary to obtain effective support from in- 
dustry. 

Even smaller communities can establish Science 
Shows. Community members are sure to lend help if 
given an opportunity while local commercial business- 
men would undoubtedly help support the expenses. 

A member of the audience (Staff Member , Systems De- 
velopment Corp., Los Angeles , Calif.) 

Educational support activities performed by the 
staff at Systems Development Corporation were de- 
scribed. At the junior high-school level, package lec- 
tures are available in 1) data processing, 2) automatic 
feedback, 3) binary and octal notation in programming, 
etc. For the high-school level, talks are available in 1) 
concepts of programming, 2) environmental simulation 
in air defense, etc. City colleges have available 1) com- 
puter design philosophies, 2) logic design, 3) comput- 
er system design, etc. A symposium has been held for 
teachers in “Implications of the Computer Age for 
Teaching Mathematics” while other lectures and spe- 
cial courses on programming and advanced math for 
high-school students have also been provided. 



